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THE KATDRB GP THE PZtOBLEtl AI^ THE 6ENEBAL 
RESEARCH OBJECTIVES 



This research arose out of the concern of one school district 
about the relative costs of varying sizes of high schools* The 
ic3aediate probleo ms with dollar costs: would it be aore economical, 
in terns of construction and oaintenance costs per pupil, to build 
one large school or two smaller <mes* It ms apparent almost at once 
that other costs of size are incurred* These other or '*non«dollar'‘ 
costs are less obvious but no less serious in that they taay deplete 
our human resources in mys economic efficiency cannot overcome* The 
non*dollar costs concern the quality of life and education in a school 
and the educational productivity of that school in terms of the general 
achievement of the students. 

On one hand, increased school size means increased population 
density and organizational complexity. It seems logical, for example, 
that higher concentrations of pupils within the confined setting of 
a school building would result in higher energy drains and that the 
complex organization of a large school would alter the personal sense 
of worth of individuals and emphasise instead the worth of the organ- 
ization* The noo-dollar costs muld be reflected in such inmediate 
mys as reduced productivity in tackling and mastering learning tasks 
and in such ultimate mys as social or psychological maladjustment* 

(hi the other hand, increased size means diversity of educational and 
social activity* For example, there m^ be more varied course work 
offered in a large school and there may be more possibility for meeting 
new and different people* The non-dollar value would then be reflected 
in an increase of actual and potential fulfillment* 

It seems, however, that the econooiic consequences of our acts and 
decisions in designing schools evoke instant interest, but the non- 
econooiic consequences are felt to be somewhat innocuous and therefore 
less disturbing* Thus the study of the non-economic consequences— of 
their causes and of their effects— has, in general, been neglected* He 
are left with few mys of being mmed of the consequences, with fewer 
clues to their onset, and with a nagging uncertainty about the level 
of educational quality which can reasonably be expected with varying 
hi^ school size* 

These considerations mi^t have been generalized to all aspects 
of the learning environment, but high school size ms clearly a decision 
frequently made and changed with little regard for anything except number 
of pupils, square or cubic footage of space, and financial resources* 

The research reported herein ms designed, then, to relate size of 
secondary school to the non-dollar costs of education carried out within 
the environment established by size* Specifically, the research was 
directed tomrd the following four objectives: 



1. To evolve a substantive theoretical fraoework in which 
empirical investigation of the influence of size on the 
interrelationships of cociaunity, school* and educational 
output can be carried out; 

2. To develop and implement a methodological and multivariate 
statistical scheme in which the framework may be represented 
and applied to data; 

3. To seek, obtain, and organize the relevant data which are 

available in machine-readable form, for two states (Iowa 
and Florida); and 

4. To esamine the utility of theM kinds of data^ of this kind 
of methodology, and of the th&retical framework in arriving 
at educationally meaningful results* 

The ultimate goal or outgrowth of this research would be a set of 
substantive relationships useful in gauging the influence that the size 
of an educational plant has on educational productivity: a local school 
board should throu^ some algorithm be able to estimate the educational 
consequences of its actions and decisions. This research was intended 
to test an approach to achieving that goal. 

In Section A of this chapter of the report, an initial conceptual- 
ization of the relationships between school size and community 
characteristics,, educational processes, and educational productivity 
is presented. In Section B, the methodological approach to measuring 
theoretically-defined categories Is described. In Section C, the kinds 
of data available and the problems in using them are outlined. 



A. School Size and the Educational Process 



The research reported here was not experimental, for it would have 
been impossible to have performed a true experiment testing the effect 
of school size: it would have been impossible to have selected school 

areas randomly, built schools of systematically different sizes, and 
measured the output productivities. Instead, this research consisted 
of a survey of the current educational situations in a large number of 
communities and a comparison of the measurable community, school, and 
productivity characteristics with the actual school sizes. 

The great problem with surveys, as opposed to proper experiments, 
is in ascertaining which of the derived relationships can reasonably be 
considered causal. There must be strong theoretical justification for 
any inference made. Suppose, for exaiiple, that m ea n occupational 
aspiration level were correlated with school size: this is unenli^tening 

unless there is strong theoretical justification for size influencing 
occupational choice, e.g., by acquainting students with suitable 
occi^tional choices. In the complex data considered here, there are 
hundreds of such relationships; some are causal and others are not. A 
theoretical framework is necessary in accessing the educational 
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signiflcaace of the results of analysis* It Is also essential to the 
systematic design of the analysis; i*e*, there oust be theoretical 
guidelines for deciding what varihbles should be construe: '.ed and how 
they should be compared. This is especially true when* in the present 
research* there is a large range of possible variables and analyses. 

For example* because there were so many data items* some item composition 
had to be performed* and a theoretical orientation was necessary in 
deciding which items should be composited. 

The metatheoretical notions for the research— that is* the goals 
and format used in generating the theory— are presented in this 
section. The details and formal definitions of the theory as finally 
evolved are given in Chapter 5. It should also be noted that one 
reason for performing the auxiliary survey of U.S. educational climate* 
presented in Chapter 3* was to provide a small example of the meta- 
theoretical techniques which were later used In the main analysis of 
high school education in Iowa. 

The first metatheoretical concept is that of a cluster. Clusters 
are quelities or processes which are considered to interact with one 
another in a primary causal way. For example* the process of 
''stimulation by conversation with peers about current events" may cause 
the quality '^litical awareness" to be acquired. On the other hand* 
a quality such as "color of classroom walls" is not the primary cause 
of anything important except perhaps through such intervening causes as 
"state of being". To say that a cluster* a process or quality exists 
is not to say that it relates to a particular variable or set of var- 
iables which one has in mind* for* as noted later* there may be no 
direct measurement of a cluster. But a metatheoretical rule is that a 
cluster should be essentially a scale* and that a school should have a 
large or small amount of it* 

There are in the complex content domain associated with the problem 
of high school size a multitude of primary causes and hence of clusters. 
It is necessary therefore to organize further. A related group of 
clusters is said to be a supercluster . Because this research is con- 
cerned with assigning causes* the organization of the clusters has been 
along the lines of the agents and recipients of causes. For example* 
all educational and social processes that take place in a school* 
which is an agent of mediation* are considered together in a super- 
cluster. The primary processes are grouped so that cause* in the sense 
of responsibility* can be assessed. 

The superclusters adopted tor this research are diagrammed in 
Figure 1-A-l. The details are given In Chapter 5. As mentioned* the 
set of processes occurring within a school is considered to form a 
supercluster* and in the diagram this is labelled "School Hediation". 

The of a school is tr.^ated as a special separate supercluster in 
the sr.urzne: i.i: is !.rbelle< "School Manlrsulation"* which is intended 

to tfc'c size., alon<; with other p'lSsiMe obiocts of sf.uOy* can 

be !carJ.<;uIateu by ad'^.lnist . itors in devising a school environiLent. 

While school size does not directly affect the students of a school* it 
helps determine the more fundamental processes which do* namely* 
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Figure 1-A-l 

The Superclusten of the Theory 



"School liediation". The students of a school are represented by 
"Chomunity Input" and "Student Output" superclusters; i«e«, a school 
receives a student body with certain input characteristics, determined 
by the comnunity , and after mediation yields the output student body. 

This may be conceived as involving a transformation of the values of 
the same clusters. The additional line connecting "Community Input 
and "Student Output" indicates the continuing influence of the community 
over the growth of a student while he is in school. 

In order to establish sufficiently realistic descriptive complexity 
of the educational situation, it is necessary to define many clusters 
even within a single supercluster. Therefore, in order to provide 
adequate parsimony, a further netatheoretical notion is introduced— that 
of factors . A factor is a pattern of clusters within a aupercluster 
which in actual data is found to be constant across schools; i.e., while 
clusters represent single theoretical causal qualities and processes, 
factors provide a reduction in the consideration of them. This includes 
consideration of their total variation but is simpler to use. For 
example, if two processes, say "stimulation in advanced mathemetics 
and "stimulation in advanced physics" are found always to occur together, 
it is reasonable and does not imply a loss of information to reduce 

to a single factor such as "stimulation in advanced science . "he 
differential relationships of a set of clusters can be much more complex, 
but the general idea is that of empirical reduction. 

To summariae, the theory is developed in order to provide justifica- 
tion for assuming causality in the derived relationships and for guiding 
the analytic procedures. In the metatheory, there are clusters, corres- 
ponding to qualities and processes of primary cause, grouped into 
super clusters , corresponding to agents or recipients of cause. And within 
each supercluster the array of clusters is reduced to factors, representing 
empirically-found patterns of cluster variation. In analyzing and inter- 
preting according to the theory, comparisons are made between the factors 
of one supercluster and the factors of another, while controlling for 
the effects of factors of still other superclusters. 



3, Measuring the Effect of School Size 



Given the theoretical framework outlined in the previous section, 
the measuretiient problems are threefolds to obtain variables corresponding 
to the clusters, to combine those variables into factors, and to compare 
the factors. Because of the restriction to data already available, the 
problem of determining variables for a cluster could not be solved by 
ascertaining the most appropriate test in terms of face validity. 

Instead, i«^dieatora of the clusters had td be found. An indicator is 
a data item which is expected to correlate with a cluster but which is 
not necessarily equivalent in definition or even causation with the 
cluster. The values of an indicator are merely required to be associated 
with the hypothetical values achieved for a cluster. For example, in 
measuring the cluster "intellectual satisfaction", indicators might be 
"correlation between aptitude tests and grades" (when there is a low 
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correlaCioa, sotie students are frustrated in the learning process) anJ 
■‘library usage'* (high library usage oay aean students enjoy readi.ig). 

The assignment of indicators to clusters must be justified in terms of 
the theoretical orientation. 

Because an indicator for a cluster is not in general a direct 
Lieasure of the cluster ^ it usually contains spurious variance, i«e«, 
the variance of an indicator may be considered in three parts: cluster 

variance, wiich corresponds to measurement of the cluster; error var* 
lance, which corresponds to imprecise determination of the indicator; 
and indicator variance, which represents the substantive difference 
between the actual indicator and the ideal cluster. The amount of 
indicator variance present in the measurement of an indicator must be 
minimized, for it may not only lower the precision of the later analysis, 
just as the error variance does, but it may also substantively bias the 
later analysis, for to the extent that indicator variance is present, 
the variable constructed for a cluster does not mean what the cluster 
does. The approach taken to lower the indicator variance involves 
determining several indicators for each cluster and compositing them 
into a single score. The idea is that when several indicators are 
combined, their coiamon parts are reinforced and their indicator parts 
are cancelled. 

The method of composition used in this research was principal 
component analysis (Hotelling, 19 Si). Given a set of items, the prin- 
cipal component is the linear combination of them which has the property 
that the sum of squares of the correlations of the item with the 
combination score is maxiraized, which is to say that the maximum amount 
of variance in the items is extracted. When the items are relatively 
homogeneous— when they have high intercorrelations— the amount of variance 
extracted by the principal component is relatively high. It should be 
noted that only the unique principal component is used; the analysis is 
not used here to factor the clusters. 

Given a set of principal component scores which represent the 
clusters, the next problem is to combine them into factors within super- 
clusters. For this purpose image analysis (Guttman, 1953) was used. 

Image analysis is a hybrid of components analysis and pure factor 
analysis. It involves an implicit transformation of the original variables 
into a set of image variables— each original variable is replaced by its 
linear regression prediction from the other variables— and then a compo- 
nents analysis is made of the transformed variables. The transformation 
provides variables which approximate the common variables of pure factor 
analysis, but the subsequent components analysis allows exact computation 
of factor scores. In this research, a modification (Harris, 1962) of 
Guttman 's original formulation of image analysis is used. This modifi- 
cation sharpens the correspondence to pure factor analysis. Because 
the image factors are to be given Interpretation in the later regression, 
they are rotated to simple structure. The noimal varimax orthogonal 
rotation scheme (Kaiser, 1958) is used. 

Two recent studies of school districts have profitably used image 
analysis. Peterson, Rossmiller, North, and Wakefield (1963) studied 
the relationships araon^ 161 Indicators of ±h^ financial status of school 
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districts In Wisconsin. The Indicators used Included, for example, 
assessed valuation per capita, mean family Income, pupils per teacher, 
and the percent of the labor force in various categories. Twenty gen- 
eral factors were Identified: twelve of these characterized fiiMnclal 

ability of a district i^lle eight referred to financial need. Ixlller, 
Conry, Wiley, and Wolfe (1557) studied the interrelationships among 
characteristics of elementary school districts in Wisconsin. These 
characteristics Included nineteen demographic Indicators such as number 
of elementary students enrolled, nuiaber of one-room schools in the 
district, and valuation per school. Twelve other characteristics were 
Indices of teacher qualities, such as the cnean and variance of teacher 
salaries, local and total experience, and highest credential held. 

The relationships between the superclusters— input, mediation, 
output, and size— are examined with multiple regression analysis (see, 
e.g. , Goldberger, 1553) on the factors across the super clusters. Two 
special regression techniques are used. First, the specific contribution 
of each Independent factor Is considered separately. This is possible 
because the factors within a supercluster are orthogonal; l.e., one may 
determine how much variance each factor accounts for without worrying 
about correlation of effects. Second, a regression technique not 
usually found In educational research Is appropriate here; this Is 
sometimes called "two-stage least-squares" and Involves computing 
regression equations between two sets of variables after having controlled 
or partlalled out the effect of a third set. For exaoq>le. In regressing 
School Iledlatlon on Student Output, the Community Input factors may be 
partlalled. 

Multiple regression analysis has been employed by Thomas (1552) In 
consideration of variables selected from Project TALENT data. Thomas 
Investigated the relationships among sixteen Indicators of the socio- 
economic class of communities and sixteen Indicators describing qualities 
of teacher staff. He found, for example, that beginning salaries and 
teaching experience were significantly related to student achievement 
and that the age of a school Is positively related to student test 
scores. 

The general flow of the analysis and measurement Is diagrammed In 
Figure 1-B-l. The ultimate regression study is neither a single 
hypothesis test nor a simple slope determination: the analysis throughout 

Is multivariate, and the effect of school size is considered to involve 
complex Interaction of Input and medlatlonal factors with output factors. 
In that respect, the actual computation presented In this report in 
Chapter 5 represents only the beginning of analysis of the present data. 

The theoretical framework— cluster, supercluster, factor— and the 
methodological framework— Indicator, composite, factor— for this research 
are rather original. Ordinarily one finds studies In the literature of 
schools and social area analysis in which raw variables, more or less 
carefully picked, are regressed. Less often, the variables are factored 
and the factors are compared. Several objections to such approaches led 
to the development of the present study. 
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Figure 1-B-l 
Data Reduction 
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The importance and meaningfulness of almost any particular raw 
demographic or educational data item can be questioned. This is not a 
matter of error; it is a matter of face validity# For example, the 
wealth of a community is not really the same as the median income of 
its inhabitants; saying that a community is wealthy means both more and 
less than saying the inhabitants have a high median income; saying a 
school has good teachers is not the same as saying the average creden- 
tial of the teachers is high. In analyzing demographic and school 
data, it is essential to remember that the raw items are indicators, 
not essential causal variables# Thus, in the present research, 
compositing of indicators was performed to achieve a higher degree of 
meaningfulness • 

Factorization is no help in bringing meaningfulness to a set of 
variables. A factor is computed when several items are associated in 
the values that they achieve over the input sample# There is no 
assurance that such association is causal or meaningful# For example, 
schools with advanced mathematics courses usually have advanced physics 
courses, but having an advanced mathematics course does not imply that 
anyone is going to learn advanced physics# Factorization may even be 
dangerous in analyzing demographic and school variables, because the 
resulting factors are often neat and satisfying. This is especially 
true when a large mass of variables is reduced to a relatively small 
number of factors. The output from a factor analysis can be no more 
meaningful than the input, and unless variables are chosen to measure 
significant theoretical processes and qualities, the outputs are useless 
in providing an explanation for the causal situation. For example, 

\rtien two highly correlating variables are included in an analysis they 
will almost always form a factor, yet they may be related to entirely 
different theoretical constructs. In this research, the compositing 
operations were intended to provide meaningful variables for analysis. 






C. 



Obtaining and Manipulating Data 



The research was confined to the use of data already available in 
machine-readable form. It therefore became, in part, a test of whether 
such data are adequate for obtaining substantive results expected of 
exacting research inquiries# The two major sources of data were; the 
U.S, Census Bureau for socio-economic data, and state departments of 
education for educational data. In both instances, the data were 
originally gathered and stored for other purposes. Of the five kinds 
of data sought, three were available in the Iowa and Florida educational 
data banks— namely, school, school district, and teacher data. The 
Iowa data bank also had complete data on students, idiile Florida had 
data on only a sample of the students. The Census Bureau data bank had 
data on all areas in both Iowa and Florida. 

A preliminary problem in using the data banks was to document the 
data items available for all entities and to determine the format of the 
data. In the preliminary stages of the research, a complete content 
analysis of the data bank manuals was made and a computer-made index 
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v/as prepared. This process took a surprisingly large auount of time and 
effort. A aajor concern in begirning the analysis of the data was with 
the alignment of different data units. For example, students are units 
within schools, and school attendance areas are units which may differ 
from census tracts. Procedures had to be worked out for dealing with 
such hierarchical and dlsconsonant units. 

The problem of hierarchical units is a very difficult one statis- 
tically. Nb adequate statistical technique was found which provided 
for comparing data at different levels In a hierarchy. How can teacher 
quality be compared adequately with student performance? The problem 
Is really one of relating the higher unit's variables to the distribution 
of the lower unit's variables, o.g., teacher quality Is related to the 
distribution of student performance. The simplest way around this 
difficulty is to compare some computed characteristic of the distributions 
at the lower level with the variables at the higher level, e.g., comparing 
teacher quality with mean performance. This and an associated technique 
V7ere used in this study. The associated technique consisted of taking 
an additional characteristic— namely, the variance of the distribution 
at the lower levelM»and comparing it. 

The problem of dlsconsonance Is very difficult practically. This 
problem arose in Iowa when the school areas were to be compared with the 
census units. There is essentially no correspondence. For the purpose 
of this research all data had to be made. First, methods for approxl- 
Listing the correspondence with statistical adjustment were considered. 
Finally, the exhausting job of comparing maps and assigning census units 
to school districts proportionally by area had to be performed. 

The problems of making correspondence between hierarchical and 
dlsconsonant units Is diagrammed in Figure 1-C-l. The substantive 
validity of using such data banks is discussed in Chapter 6. It Is 
appropriate to note here their use for purposes other than originally 
intended creates difficulties not easily foreseen. 
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Figure 1-C-l 

Hierarchical and Discoraonont Units 
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CHAPTER 2 



FESSPECTIVES ON THE RESEARCH PROVIDED BY PREVIOUS 

INVESTIGATORS 



In order to provide a perspective for the present research on high 
school size as related to educational productivity and to identify 
substantive constructs and hypotheses relevant to the topic, a survey of 
pertinent literature was undertaken. The findings that previous investi* 
gators have found to be related to organizational size are reported in 
Section A of this chapter^along with a tabulation of the variables. In 
Section B, a resuoe of research directly related to school size is 
presented. As was explained in Chapter 1, school size is one link in the 
causal chain of the educational process. Related to this is the fact 
that while cocnunity characteristics do exert an important effect upon 
students, the effect of school size is differential across types of 
^coomunities. Therefore, in Section C, a survey of the literature on 
social area analysis and a tabulation of the variables which previous 
researchers have found relevant to this topic are presented. Finally, 
in Section D, there appears an abbreviated transcript of a colloquiua on 
the Determination of Sdiool-Comaunity Characteristics which was held in 
January, 19G7. As the transcript indicates, the meeting was initially 
designed to focus on the problems of measuring the community character- 
istics of school areas. However, the discussion proceeded to probe into 
the problems and relevance of planned measurement as it ccmcemed this 
specific research undertaking. 



A. Organizational Size 



A review of the literature pertaining to organization or group size 
was made. Initial sources were obtained from the extensive bibliogra^y 
accompanying Volume 1 of The lianaging c>f Organizations by Bertram (bross. 
Other articles were found by perusing the Education Index . Psychological 
Abstracts. Sociological Abstracts , the Science Citation I^ex. the Review 
of Educational Research, and the Reader's Guide to Periodical Literature . 

The general paucity of articles pertaining to size as a characteristic 
of organization is illustrated by the fact that only eight articles 
appeared in the Sociological Abstracts between 1953 and 1963. Similarly, 
there were only six pertinent articles cited in Psychological Abstracts 
between 1955 and 1965. 

Cross references from the various articles relating to organizational 
and group size were explored. IWenty-eight were found which were directly 
related to organizational size, while another twenty were extensively 
reviewed. In addition, a tabulation of the variables related to organ- 
izational size is provided below: 



1) Absence (Revans, Indik-C) 

2) Accident rate (Revans) 

3) Afoount of delegation to subordinates (Hanson) 

A) Auount of executive responsibility (Hanson) 

5) Attitude (Talacchl, Indlk-C) 

6) CoasBinlcatioa problems (Caplow) 

7) Compensation (Simon) 

G) Creativity {GlhlfA 

9) Depersonalization (Burling) 

10) Effectiveness (Idkert) 

11) Efficiency of task performance (Likert) 

12) Employee morale (Worthy) 

13) Ethical commitments (Thomas) 

14) Frequency of succession (Gruslq^) 

15) Group stability (Caplow) 

Id) Individual behavior (Talacchl) 

17) Input (fierbst) 

18) Leadership perceptions Oiedalln) 

19) llanagement structure (Revans) 

20) llend>er participation (Warner, Indlk-A) 

21) itorale (Revans) 

22) Operating efficiency (Worthy) 

23) Organizational environment (Gross) 

24) Organizational purpose (Gross) 

25) Output (Herbst, Harriott, Revans, Indik-C) 

26) Problem-solving ability (Gibbs) 

27) Proportion of resources devoted to self -maintenance (Caplow) 

28) Punctuality (Revans) 

29) Role conception (Thcmias) 

30) IU>le constfisus (Thomas) 

31) Size of administrative component (Anderson, Terriens) 

32) Sociological effects on group (Slmmel) 

33) Strikes of employees (Revans) 

34) Supervision ratio (Revans, Indlk-B) 

35) Tenure of leadership (Krlesberg) 

36) Turnover (Indik-C) 

37) Uniformity of organizational design (Caplow) 

38) Work performance (Thomas) 

Xhe literature on organizational size can be grouped Into three 
categories: (1) the effects of organizational size on employees, 

(2) the relationship between organizational size and organizational 
output, and (3) the effect of organizational size on the structure of an 

organization* 

In the first category, the size of an organization has been found 
to have varied effects on employees* In a study of industrial statistics. 
Revans (1959) found evidence of lowered interest in one's job among 
employees as organizational size increased. The lower morale was reflected 
by evidence of greater absenteeism, an increased accident rate, a lower 
rate of punctuality, and a greater number of employee strikes as organ- 
izational size increased. In another study, Talacchl (1960) who studied 
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British miners, discovered that the rate of subscription among miners to 
a mining magazine was consistently lower among the workers in the larger 
mines; the salient conclusion in this research was that the dissatisfaction 
of workers in the larger mines was significantly correlated with the 
lower subscription rate to the mining magazine in such mines* In another 
experiment recently reported by Argyris (1S59) it was found that by pur- 
suing a formula of ''optimal undermanning”, branch banks were able to 
increase worker productivity, cohesiveness, morale, and cooperation* 

In a study done Worthy (1950) Sears*Soebuck eoiployees declined in 
morale as the size of administrative units and the complexity of the 
organization increased. 

Other studies are also pertinent to the effect of organizational size 
upon employees. Simon discovered that compensation was a factor positively 
and pri^rily related to organizational size. 1-Iedalia (1954) in a study 
of Air Force squadrons found that the squadron leaders ware seen as less 
human persons as the size of the squadron unit increased. In a study 
investigating the relationship between membership participation and the 
size of an organization, WSmer (1964) and Indik (1961) found that the 
size of an organization was not inherent to membership participation but 
rather that a cluster of factors is related to this enq>loyee behavior. 
Finally, in a study of various groups of welfare workers, Thomas (1959) 
found t^t smaller groups had persons with higher ethical commitments, 
greater breadth of role conception, and greater role consensus than did 
larger groups. 

Several studies were found linking organizational size to organiza- 
tional output. Likert (1961) found that group effectiveness declined 
above an ascertained optimum group size. In another study. Worthy (1950) 
discovered that with the increasing size of an administrative component 
of an organization, worker efficiency decreased. In a study of automobile 
factory workers, Harriott (1949) found that worker output increased in 
smaller groups. A higher quality of worker performance was correlated 
positively with smaller work groups in the study of welfare workers made 
by Thomas (1959) while a study of college students made by Gibbs (1951) 
revealed that creativity and problemsolving abilities among students were 
enhanced by small group organization. 

In the review of research regarding the relationship between organ- 
izational size and organizational structure, Hanson (1964) found that 
there was a tendency to delegate more responsibility in larger groups. 
However, Caplow (1957), Burling (1956), and Argyris (1959) have also 
found that larger groups are characterized by having greater communication 
problems and by being more depersonalized. In another study, Kriesberg 
(1962) found that larger organizations were characterized by a decreased 
tenure of leadership while Grusky (1962) found that in such organizations 
there existed a greater frequency of succession in leadership positions. 

In a study investigating the relationship between the proportion of 
resources devoted to self -maintenance and organizational size, Caplow (1957) 
discovered that the proportion of resources allocated to this function 
increased as the size of the organization grew. A study by Terriens 
revealed that as organizations increased in size, the administrative 
component also enlarged. This study is balanced, however, by a study 
made by Anderson (1961) which showed that the administrative component 
was proportionately less in larger organizations idien other factors such 
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as Incraased ntunber of tasks to be perfomed or the increased ninaber of 
plants in the organization were excluded* 

Finally, it is significant to note that there is not a well-defined 
body of literature on organizational size as it relates to educational 
enterprises* There are, however, two recent studies whose findings 
deserve citation. In a study of organizational decentralization in the 
school systems of I-iontgomery County, Maryland and Atlanta, Georgia, 

King (1966) discovered that system decentralization contributed to Improved 
teaching conditions, better cociaunication, and an improved instructional 
program for students. Also pertinent is the report of the recent study 
of the New York City School System by Gittel (1967) and Hollander (1964)* 
These investigators maintain that their research data reveal that the 
New York City School System is virtually paralyzed If its large and 
ponderous bureaucratic structure. Furthermore, the study asserts that 
this mammoth organization has produced inertia and hindered the school 
system from responding effectively to new demands. Evidence for this 
conclusion is provided by data revealing that the N^ York City System 
has not made any meaningful changes for thirty years in curriculum, 
administrative structure, teacher recruitment, appointment procedures, 
or training. The investigators suggest that organizational decentralization 
of the system mi^t contribute to the facilitation of greater organizational 
effectiveness* 



As mentioned above, much of the research which has been done on 
the influences of school size relates size factors to dollar costs* ^ The 
resulting evidence has been useful where costs alone are under consider- 
ation* Since those studies are not of direct concern in this research, 
they have not been presented below. The research reported below was 
found by searching such sources as Dissertatiwi Abstracts , Education 
Index , Encyclopedia of Educational Research. Review of Educational 
Research » and Journal of Educational Research . 

Attempts have been made recently to relate certain factors of size, 
especially enrollment, to indices of <iuality. Quality has been defined 
in different ways, usually in quantitative terms such as breadth of 
course offerings, teaching loads, or special services* Some of this 
research has produced positive correlations between quality, defined in 
those ways, and size. California (Smith, 1964), North Carolina (Weaver, 
1961), Florida (Dungan, 1961), Alabama (Vardman, Crocker, 1958-60), 
Mississippi (Osborn, 1962), and Missouri (Savillm, 1961) have been the 
loci of studies substan':iating the relationship. A number of persons 
report a positive correlation up to specific points in size; e.g., 1200 
students. Beyond this poi:it a plateau is soon reached and negative 
correlations appear. With f-.irther increases in size the benefits did not 
grow, and the disadvantages of inoersonality and lack of contact became 
evident* Barker and Gump (1964) report that a negative relationship 
exists between institutional size and individual participation* 
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Additional studies show that other factors, such as state of residence, 
parental educational background (Bryant, 1957), social adjustment (Cluff, 
1954), and teacher approval (Christensen, I960), have more effect on the 
selected laeasures of quality than does size. The advantages that are 
cited in the literature regarding secondary schools above 700 enrolluent 
are usually quite specific and may be sumoarlzed as follows: 

(1) Better faculty 

qualifications 

compensation 

stability 

(2) Better conditions 

more efficient administration 
lighter teaching loads 
more resource persons 
lower teacher-pupil ratio 
more materials and equipment 

(3) Better programs 

laore opportunity to experiment 
more courses 

more extra-curricular activities 



The disadvantages cited are usually less specific and more difficult 
to document: (1) less personal contact between students and teachers; 

(2) questionable psychological effects on students; (3) lower participa- 
tion In extra-curricular activities; and (4) weaker staff communication, 
liost such studies are of schools having fewer than 1000 students. 

Information from studies of this kind have been useful In seeking 
variables to be Included in the present research. In the face of the 
general feeling that large schools may be detrimental in some ways, changes 
have been proposed which are Intended to remedy the situation, '^chools- 
wlthln-a-school" (Smith, 1961) or ’'vertical units” (Evans, 1960) with 
independent faculties and student bodies have been established to try to 
gain the advantages that come wit ^ ^Ize, but, obviate the disadvantages 
that are noted after the "optimum' oize has been reached. See Figure 2-B-l. 

An analysis was made by Gaumnltz and Tompkins (1950) of survival in 
high school based on the U.S.O.B. Biennial Survey of 1945-43. The 
results are in part a function of the variations among states in statutes 
regarding compulsory attendance. Their findings are as follows: 

(1) Hi^ schools are not holding over 50X of pupils through to 
graduation. 

(2) The range from highest to lowest states Is from 2:1 to 3:1. 

(3) Separate school systems for minority groups Invariably show 
lower holding power. 
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Variable 



Source Document 



1 ) 

2 ) 

3) 

4 ) 



5) 

S) 

7) 

3) 

9) 

10 ) 

11 ) 

12 ) 

13) 

14 ) 

15) 

16) 

17) 

18) 

19) 

20 ) 
21 ) 
22 ) 
23) 



Achievetient 

Adulnis trat ion 

Cocaounlcatlon probleos 
Curriculum 



Extracurricular activities 
Faculty qualifications 



Bragg, Gray, Jantze, Lamberty, McDaniel, 
Smith, H. J., Vardeoan, Weaver 
Dungan, Grady, Garcia, Shelley, Smith, 

H. J., Watson 
Evans, Shapiro 

Andrews, Barnard, Berg, Bragg, Cracker, 
Dungan, Garcia, Kemps ton, MacArthur, 
McDaniel, Osburn, Rougean, Savllla, Smith, 
C., Smith, H. J., Sollars, Treadway, 
Vardeman, Woods 

Andrews, Garcia, Grady, Gray, McDaniel, 
Weaver, Woods 

Barnard, Collingsworth, Cracker, Rougean, 



Faculty stability 
Faculty training 
Group cooperation 
Guidance 

Materials for Instruction 
Morale 

Personal relations 
Program 

Pupil-personnel ratio 
Pupil-teacher relationship 
Range of courses offered 
School-community relations 
Special Instruction 

(ability grouping classes) 



Vardeman 

Gray 

Weaver 

Shapiro 

Dungan, Gray, Rougean, Smith, H.J., Weaver 
Smith, C., Woods 
Evans, Zlnser, Gaumnltz 
Smith, H. J. 

Jackson, MacArthur, Shelley, Treadway 

Garcia 

Tyson 

Barnard 

Andrews 

Savllla, Weaver 



Student characteristics 
Teacher characteristics 
Teacher effectiveness 
Teaching of basic skills 



Williams 

Patterson 

Shapiro 

Loughrldge, McDaniel 



Figure 2-B-l 
Summary of 

Variables Related to High School Size 
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(4) Schools organised as Junior-Senior High Schools do better on 
lolding than do other types of organizations. 

(5) Large high schools are not holding pupils any better than 
Sinall ones. 



Barker and Gunp (1S54) suggest three alternatives to handle increas- 
ing enrollments and preserve small school advantages: (a) establish new 

small schools; (b) set up campus schools in which students are grouped 
in semi-autonomous units; (c) set up autonomous campus units. They 
state that a school should be large enough to have a comprehensive program 
and small enough so that (i) a higher percentage of its students are 
needed for its various enterprises and (11) the enterprises are not 
redundant. Yet, they conclude that their research does not answer the 
question, ‘Itow large should a school be?" The reason given is that 
crucial variables related to educational productivity were not Investigated. 



C. Social Area Analysis 



In doing the preparatory work in this study a search was made of 
reported research to determine the instruments and methods of analysis 
which would be most useful in the work. Disser tation Abstracts, Economy 
Abstracts . Sociological Abstracts , aad The Educational Ind^ were used as the 
starting point in assembling pertinent information in regard to the 
classification of conaunity characteristics. The most useful material 
found to date is presented below. 

"It is a matter of everyday observation," says Wendell Bell, "that 
metropolitan areas are subdivided into different sections, each exhibit- 
ing different features." A number of investigators have tried to capture 
the distinctive features in some kind of typology that would be useful 
in analysis. 

An approach to a typology will vary with perspective of the 
investigator. Duncan and Schnore (1959) Indicate three different 
emphases: (1) Cultural, (2) Behavioral, and (3) Ecological. The cultural 

perspective considers cultural factors as the major predictor of spatial 
clustering in the city. It emphasizes the symbolic value of space and 
is critical of the economic bias of ecology. The behavioral or inter- 
act ionist perspective tends to be concerned with the frequency, intensity 
and forms of interpersonal communication. The ecological perspective 
stresses the influences of technology, population characteristics and 
environment on social organization and spatial distribution. None of 
these emphases is sufficient to explain the complexities of urban 
organization, but each concentrates on and elucidates different facets 
of a multidimensional problem. 

Shevky and Bell (1955), in a technique called "Social Area Analysis," 
acknowledge their indebtedness to the urban ecologist. This social area 
analysis approach has been used by a number of investigators in their 
studies of metropolitan areas. The social area is dependent on spatial 
contiguity. 
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The basic unit of analysis in the construction of social areas has 
generally been the census tract. Chicago is divided into more than 
1000 of these relatively small geographical areas. Census bulletins 
contain a fund of information on the population in the tracts»»race, 
sex* marital status t type and condition of dwelling* number of persons 
per dwelling, income, type of heating, rent, etc, ••which is useful in 
analyzing the area. 

Shevky and Bell sought to organize this maze of census data. These 
data were arranged in socioeconomic, family and ethnic sets. The rationale 
for this selection is presented schematically in Figure 2^C^1, 

Tryon (1955) independently analyzed the San Francisco Bay area and 
evolved a near identical classification. Tryon employed a more extensive 
list of variables, 33 in all: eight dealing with population character^ 

istics, thirteen with occupations and twelve with dwelling units. By 
a method of cluster analysis of the 33 variables, Tryon found seven 
distinct domains: Socioeconomic achievement. Family life, Assimilation, 

Female schievement. Socioeconomic achievement 2, Assimilation 2 and 
Socioeconomic independence. 

Tryon gathered his domains under four major factors: Socioeconomic 

independence. Socioeconomic achievement. Assimilation, and Family life. 
These empirical dimensions have a validity beyond those of Shevky and 
Bell, which were the result of theory and insight. The factors are listed 
below so that the reader can note their similarity. 



Shevky and Bell 

1 . Socioeconomic Status 

(Social Rank) 

2. Ethnic Status (Segregation) 

3. Family Status (Urbanization) 



Tryon 

1. Socioeconomic Achievement 

2. Assimilation 

3. Family Life 



Tryon, after complir.enting Shevky and Bell for their insight in 
choosing on £ priori grounds what he derived empirically, questioned the 
fact that they were missing a factor that would parallel his socioeconomic 
independence. He suggested that their indices would have been improved 
by using more measures. 



Kaufman (1961) revised the ShevkyBell procedures and tested his 
new variables by analyzing the cens*«s data for the Chicago and San 
Francisco areas. He dropped rent from the socioeconomic status measure 
and retained education and occupation, but defined them differently. He 
also eliminated house type from the family status indicators because of 
its limited application. The fertility ratio and proportion of women 
not in the labor force were retained and two additional measures were 
added, one to qualify the fertility ratio and the other as a measure 
expressive of family status. The ethnic status measured was redefined as 
the proportion who are not native born whites. Kaufman still maintained 
seven measures but believed that the changes he suggested improved the 
analysis. 
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PoituUtes 
Concrrntnf 
Indiittrisl 
Sowiy (Atpecti 
of Incmsinf 
Scale) 

(«) 



Change in the 
range and 
intensity of 
relations 


/ ' 


Differentiation 




of function 




Complexity of 
organisation 


\ 



Statittici 
of Trendi 
( 2 ) 

Changing distribution 
of ikills: 



manual productive op- 
erations — growing im- 
portance of clericali su- 
pervisorjTi management 
operations 

Changing structure of pro- 
ductive activity: “ 

Lessening importance of 
primary production ~ 
growing importance of 
relations centered in 
cities— lessening impor- 
tance of the household 
as economic unit 

^Changing composition of 
population: “ 

Increasing movement — 
alteration: in age and 
MX distribution — In- 
creasing diversi^ 



Changes in the 
Structure of a 
Given Social 
System 
(3) 

Changes in the 
^ ar rangemenr* 
o f occupationa 
based on func- 
tion 



Changes in the 
^ ways of living* 
—movement of 
women into ur- 
ban occupations 
—spread of al- 
ternative family 
patterns 



Redistribution in 
• space — changes* 
in the propor- 
tion of support- 
ing and depend- 
ent population 
•isolation and 
segregation of 
groups 



Constructs 

(4) 

S^cist Rsnk 
^ (economic* 
status) 



Vfh4tni%0$hn 
^ (family * 
status) 



SegregMiUn 
^ (ethnic 
status) 



Sample 
Statistics 
(Related to the 
Constructs) 

(s) 

Years of schooling 
^Employment status * 

Class of worker 
Major occupation group 
Value of home 
Rent by dwelling unit 
Plumbing and repair 
Persons per room 
Heating and refrigera- 
tion 

Age and sex 

^Owner or tenant ' 

House structure 
Persons in household 



Race and nativi^ 
^Countiy of Urth 
Gtisenship 



Derived 
Measures 
(Fitwn 
Col. 5) 
( 6 ) 



Occupation 
^Schooling 
Rent 






Fertility 
^Wonen at 
work 
Single- 
family 
dwelling 
units 



Racial and 
^ national 
groups 
in rela- 
tive 

isolation, 



Index 

II 



Index 

III 



Figure 2-C-l 



Steps in G>nstruct Formation and Index Construction* 



*Shevky and Bell, 1959, p« 4. 
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Edward Koch (1966) used .an adaptation of the Shevky*3ell analysis 
In his study of the elementary school districts of Bin^amton, 
Kassachusetts* However, instead of dealing with census tracts directly, 
Koch converted census information to school district data. He also added 
an index, residential stability, to the three standard Shevky-Bell indexes. 
This factor, together with social rank, proved to be the most important 
factor in predicting academic achievement. He found that neither 
ethnicity nor urbanism was significantly related to academic achievement. 
Koch's contributions, then, were his method of reducing census tract 
data to school district data and his addition of a fourth factor, residen- 
tial stability, to improve the analysis. 

Most of the studies reported to date have been devoted to the 
Shevky-Bell method of social area analysis or assorted modifications of 
it. The reports which follow are distinct from this model and report 
on varying geographical areas with many more indicators than any of the 
previous writers have utilized. 



Christen Jonassen (1958) used eighty-two variables in his search 
for functional unities in the eighty-eight Ohio counties. He used factor 
analysis, as Tryon used cluster analysis, to see if the vast number of 
community variables would form independent clusters of highly interrelated 
components. He wanted to simplify the coLiplex structure of community 
interrelationships to a manageable list of factors which could account 
for the differences between community systems. 



Jonassen used factor analysis on the eighty*two indicators of Ohio 
counties and arrived at seven factors or clusters: Urbanism, Welfare, 

Influx, Poverty, Magni-Complexity, Educational Effort and Proletarianism. 
Urbanism loaded significantly on fifty-one of eighty-two county measures 
with very high loadings on clerical and sales workers, urban populations, and 
social complexity. Welfare loadings were high on efficiency, welfare, 
educational sacrifice and educational effort. Influx had high loadings 
on migration gain, population gain and population mobility> among others. 
Kagni-complexity was characterized by high loadings in population density, 
size, educational plant size, and governmental complexity. Educational 
effort is self evident; finally, proletarianism had high positive load- 
ings on unskilled workers and a high negative loading on wealth differential. 



Byron Munson (1966) at Ohio State University replicated Jonassen 's 
work in Ohio and Charles Bonjean used a majority of the same measures in 
a national <itudy. Both of these investigators should have reports 
published on their results soon. 



The emergence of these meaningful clusters, Jonassen felt, was an 
indication of the reliability of the units of measurement and of the 
basic data. The eighty-two variables used did piece together to yield 
a significant picture of systems within the established framework of 
census information. 



Roger Lennon (1952) did a study on the "Prediction of Academic 
Achievement and Intelligence from Community and School System Character- 
istics." He neatly suimnarized the studies of Thorndike, Davenport -Rummers, 
Mailer and Pierce and then went on to confirm a number of hypotheses 
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relating achievement and intelligence with a variety of community char* 
acteriatica. He inveatigated the extent to which meaaured achievement 
of the public elementary school population of a community is predictable 
from a knowledge of certain social and economic characteristics of the 
community, and certain characteristics of its school s;,Jteffl. He found 
that median schooling of the '^dult population, median monthly rent, per 
cent of adult Illiteracy, per cent of native i^ite, and per*pupil 
expenditure were consistently highly correlated with academic achievement. 
Achievement is positively associated with qualities pertaining to the 
intellectual, cultural, and economic status of the adults in the community. 

Schutz (1955, 1950) also used the factor analytic approach in two 
studies, one of educational development in the United States and the 
other of academic achievement and community characteristics. In both 
cases he was looking for functional unities which might be useful in 
further educational research. 

The 1955 study of state data was based on thirty variables* Four 
factors were extracted using Thurstone's complete centroid method. Schutz 
named the factors urbanization, administrative organization, intellectual 
climate, technological advances and lagging social institutions. 

In his 1950 study, Schutz used twenty variables. The data were 
collected and correlated by Gawkowski (1955) in his work on the use of 
community characteristics for obtaining local norms on standardized 
achievement tests. The variables included indices of the educational 
background of adults in the community, the labor situation in the com- 
munity, the racial and national origins of the inhabitants, the financial 
income and the academic achievement in the community. 

Five factors were extracted using Thurstone's complete centroid 
method: (1) Urban-Financial, (2) Intellectual Climate, (3) Economic 

Stability, (4) Academic Achievement, and (5) Low Soc^v ^economic Status* 

This analysis, based on eighty-four local communities, provides evidence 
at another geographical level of the existence of these factors. The 
fact that academic achievement loads only on two achievement variables 
suggested that high academic achievement can be obtained in widely 
differing coomunities. 

James Beshers (1955) studied census tract data and social structure. 
His study yielded four factors from the twenty items of census information 
which he used. He called his factors social status, «young faraily, 
female-stability and race. 

The work of Johnson (1958) and Kaplan (1958) was directed by 
Borgatta at New York University. Johnson searched for functional unities 
in an analysis of United States county data. He found six factors that 
accounted for 100 per cent variance. Since population size itself 
accofjinted for sixty per cent of the data variance, he decided to convert 
all data not in rate form into per capita rates and to re-analyze. Forty- 
five significant variables resulting from preliminary analysis were 
re-analyzed and five factors accounting for sixty per cent of the variance 
in the second analysis were retained. The factors are: (1) Size of 
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population, (2) Non*£aru, urban*rural monetary econoaqr*leeel of living, 

(3) Higb farm-operator family level of living index and stable demographic 
structure, (4) Age structure of the population, and (5) Residential 
stability. 

Kaplan's analysis of the sietropolitan areas of 25,000 or over also 
yielded five factors idiich accounted for seventy per cent of the variance: 
(1) Population size, (2) Socioeconomic dimension, (3) Population stability 
and growth, (4) Ethnic and racial homogeneity plus residential orienta- 
tion, and (5) Age-sex structure. 

Borgatta directed a similar study by Jeffrey Hadden at the University 
of Wisconsin (1963). Part of Hadden's dissertation was a replication of 
Kaplan's work. He then extended his investigation in an effort to 
determine whether the underlying dimensions that he and Kaplan discovered 
would obtain when other territorial units were observed. The results of 
factor analytic studies lent considerable support to the proposition that 
the major factors discovered represent stable underlying dimensions of 
urban social structure. These factors remained basically unaltered 
when different territorial definitions of cities were examined (SHSA's, 
Urbanized Areas and Cities). The principal value of this empirical 
approach was that it provided a succinct way of sumnarizing and ordering 
the relationships that exist among a large number of variables. 

Another study that may be useful in the present investigation is 
that of Allen and Bentz (1964). They were interested in developing an 
instrument to measure sociocultural change using state data. They factor 
analyzed thirty-two indicators which enabled them to assess change in 
various parts of the society and culture. Their work yielded four 
factors: (1) standard of living, (2) population growth, (3) industrial- 

technological -urban development, and (4) education. Since they were 
dealing with change all their indicators were rates, i.e., percentages. 
They rated forty-ei^t states on their indices and were satisfied with 
their ability to measure change and suggested that future studies may 
attempt some type of study of causal connections between the components. 

Three additional reports have been reviewed. One was a study by 
Wood (1961) of the public sector of the New York metropolitan region. 
Included were 22 counties and approximately 1400 governments. In the 
analysis, seven factors were extracted which were described as 
"identification tags, not complete descriptions of .he contents." They 
were composed of thirty-four variables, twenty of which were described 
as socioeconomic and fourteen as fiscal. The seven factors were: 

(1) community size, (2) industrialization, (3) housing density, (4) age, 

(5) low income prevalence, (5) residential affluence, and (7) land reserve 

In a study-by .Peterson,-Bossmiller, Wakefield, and North (1963) of 
a large number of variables associated with ability to support education 
and need for education support, 161 variables were identified and reduced 
to fifteen factors. They relate to the following: (1) personal income, 

(2) certain municipal expenditures, (3) certain municipal receipts, 

(4) agricultural land valuation, (5) size of certain hi^ school classes, 

(6) certain school expenditures, and (7) staff-pupil ratios. 
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The fiiul citation it Raiasiaan'a (1>64) critique of claaaification 
of social data. Ee analyzed five indices and their indicators. The 
five indices were: (1) population size, (2) economic function, 

(3) occupational distribution, (4) moral integration, and (5) social area 
analysis. 

The factors extracted in the fourteen investigations are presented 
below: 



1. Trvon’s Clusters (1955) 2. 

Family Life 
Assimilation #2 
Socioeconomic Independence 
Socioeconomic Achievement #2 
Assimilation #1 
Female Achievement 

3. Jonas sen (1958) 4. 

Urbanism 

Welfare 

Influx 

Poverty 

lia gni-complexity 
Educatf.onal Effort 
Proletarianism 

5. Schutz #2 (1955) 5. 

Urbenication 

Administrative Organization 
Intellectual Climate 
Technological Advances and 
Lagging Social Institutions 

7. Johnson (1958) 8. 

Population Size 
Level-of-Living 
Hi^ Farm Operator Level of 
Living 

lledian Population Age 
Residential Stability 
Commei *«.alized Farming vs. 
Subs.Jtence Farming 

9. Hadden (1963) 13. 



Bell"Shevlcv and Bell (1955-61) 

Social Rank or Socioeconomic Status 
Urbanization or Family Status 
Segregation or Ethnic Status 



Schutz — #1 (1950) 

Urban-Financial 
Intellectual Climate 
Economic Stability 
Acadonic Achievement 
Low Socio-Economic Status 



Allen and Bentz (1954) 

Population Growth 
Industrial-Technological-Urban 
Development 
Standard of Living 
Education 

Kaplan (1958) 

Population Size 
Socioeconomic Dimension 
Population Stability and Growth 
Ethnic and Racial Homogeneity 
Age-Sex Structure 



Koch (1955) 



Population Size 
Socioeconomic Status 

Hew Residents 
Age of Residents 

South vs. Non-South 
Per Gent Living in Group Quarters 
Population Density 
Per Cent of Employed Worked Outside 
the County of Residence 
Per Cent Non-White 
Population Increase 

(Nine other factors unnoaed) 



Social Rank 
Residential Stability 
Ethnicity 
Urbanisia 



11« Kaufaan (K31) 



12. Wood (1961) 



Titles 82 T.e as Bell^ShevIqr 
but Indices Differ 



Coomunlty Size 
Industrialization 
Housing Density 
Age 

Lo«*income Prevalence 
Residential Affluence 
Land Reserve 



13. Peterson » Rossmiller, 

Wakefield « and Worth ( IS S3) 



14* Retssoan (1964) 



Income 

liunicipal Revenues 
Hunicipal Sxpenditures 
Equalized Valuation 



Population Size 
Economic Function 
Occupational Distribution 
lioral Integration 
Social Area Analysis 



Colloquium on the Determination of 
School**C(niinunity Characteristics 



On January 5th and 6th, 1967, a colloquium was held at Hadison, 
Wisconsin on the determination of school-community characteristics. Its 
purpose was to seek additional guidance from special consultants with 
regard tc problons encountered in the conduct of this research. These 
problems were of three kinds: (1) the dimensions of community relevant 

in the study of school units, (2) the geographic mapping of such data, 
iind (3) estimation techniques useful in cood>ining data from hierarchical 
or disconsonant units. The recsainder of the colloquium was devoted to 
discussion and application of the ideas presented on the three problems. 



iMiiiAnn-ions of Co«wa«mitv Characteristi cs Relevant 
in the Study of School Pnits (Bonjean, 1957) 

(An abstract of a formal paper presented by Bonjean is presented 
l>elow, followed by a summary of the discussion which ensued.) 

A major problem associated with comparative analysis, including 
factor analysis, of communities results from the ambiguity of the concept 
"coocunity". Previous factor analytical studies have been limited to 
atypical sets of cocmunitles. 

The present study utilizes the county as a unit of analysis, and 
Includes variables used in previous studies. The purpose is to locate 
dimensions which explain variation and note similarities with dimensions 
identified in previous studies. The county was selected as the unit of 
analysis because it is the largest admlni6i"‘*“ative unit below the state, 
it involves both rural and urban situations, and it is as clearly defined 
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politically, socially, econonically, culturally and functionally as is tha 
city* Variables were drawn frou the U*S* Bureau of C nsus inforoation and 
previous studies* The universe of the study is a duplicate, with few 
exceptions, of the universe used in The Countv and <^tv Data Book * 

The oethod includes initial factoring by a version of Hotelling's 
principal axes oethod to reduce the orthogonal factors, and rotation by 
nonaalized varloax procedure to accoooodate the factor breadth of the 
study. In this study variables are called prloary variables of a factor 
if they have their greatest loading on the factor in question* 

Eighteen factors accounting for 78*57* of total variance were extracted. 
The eight accounting for the greatest amount of variance are the following: 

X. (accounting for 18.2Z of the total variance) The "Socioeconomic 
Status" factor, which includes variables indicating social stratification, 
has been the cx>st critical factor in previous studies as well* Variations 
among the different studies with respect to this factor are most likely 
due to the inclusion of different indicators* 

II* (11*31 of total variance) The second factor includes variables 
relating mainly to age structure, althou^ considerations of the direction 
of the loadings and interrelations among variables suggest the name "Family 
Life Cycle"* The non*occurrence of this factor in one previous s^udy 
might be attributable to the universe of that study since there is con- 
siderable overlap in variables and the units of analysis are the same* 

III* (6*1X of total variance) Factors similar to this third 
factor, 'Yjovemmental Revenues and Expenditures", do not appear in previous 
studies, probably due to the fact that most of the important variables 
clustering about this factor were not used in the earlier studies* The 
relationship of this factor to factor I is less than expected, althou^ 
its greater relationship to educational rather than wealth variables 
might have been anticipated* 

IV* (5*51 of total variance) "Residential libbility" occurs as a 
factor in previous studies and Includes similar variables* This factor 
shows some relationship to factor 1* 

V* (4*2X of total variance) The primary variables associated with 
"Urbanism" are the classical variables of size, density, and heterogeneity, 
yet the low percentage of total variance belies the traditional importance 
of the factor, suggesting a decline in urban-rural differences, a conclusion 
which can only be tentative, however, for traditional use of these variables 
involves parameters not available for the present study* In all investi- 
gations, this cluster of variables was found to be relatively Independent 
of the cluster of variables called "Socio-economic Status"* Unexpected 
loadings may be explained a distinction between classical definitions 
of urbanise and that used by the U*S* Census Bureau* 

VI* (4.0X of total variance) The negative loading of one primary 
variable in "Lianufacturing Concentration" suggests that goveznraent centers 
are not usually taanu factoring centers and vice versa* This suggestion 
is supported by findings of a prevlotis study* 
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VII. (3.9% of total variance) Another economic specialization 
factor, "Comnercial Center", reveals a limitation of factor analysis. 
CoiqMrisons of factors of this and previous studies indicate a divergence 
of results. Attention to detail, however, discloses obscured relation- 
ships which usually dissolve the divergence. 

VIII. (3.1% of total variance) 'Unemployment" does not occur as a 
factor in previous studies. The critical variable involved, however, 
does occur in these studies though associated with factors idiich do not 
appear in the present study. 

Hone of the remaining factors explains more than 2.9% of total 
variation. 

Comparisons of all the studies considered suggest that the primary 
dimensions of community laay be termed "socio-economic status", "residential 
mobility", 'Urbanism", and "non-white". Only two factors which occur in 
the present study, "government revenue and cbcpenditures" and "unemployment" 
do not occur in the previous studies considered. However, these factors 
appear to be the most dependent upon the selection of variables and unit 
of analysis. 

Discussion of Bonjean's presentation: If one is interested in find- 

ing independent dimensions, or in clustering out a large amount of data, 
it is advantageous to use the least number of concepts in an orthogonal 
analysis, i.e., uncorrelated factors. Perhaps, however, it would be 
better to start with the asfnjmption that there are independent uncorrelated 
factors, doing an oblique, and then inferring rather than assuming that 
these are consistent independent dimensions. 

The basis of the 79 variable selection was not theoretical, since a 
base for selection would draw one away from a pure factor analysis. It 
was supposed that if too many related variables were used, artificial 
factors might be created, factors which are a function of the data pulled 
in rather than of the underlying character of cities. The basis of 
selection from a previous study in which the criteria for selecting 

c<xnmunities included functional or economic specialization and size. Such 
selection is not an end in itself — no factor analysis is — but it may 
be useful in one type of description, even though it does not explain 
anything nor can predictions be made from it. 

There seems to be an assumption in this study that large and small 
counties vary in the same way, for unless one controls for size the other 
factors do not discriminate. 

The studies which have been made allow one to discover relationships 
among comaunities. Previous studies have provided clues and rudiments 
which may allow theory building. A problem involved with any type of 
social research, however, is drawing boundaries around the social unit 
concerned. At this time, such boundaries can only be proximate. 
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fla pping Data and Units (Sale* 1967) 

The problem at hand la that of the population of school districts. 

Such figures are collected on the basis of tounshlps rather than school 
districts 9 and the boundaries of townships and scho;l districts are allowed 
to differ In most states. The approach so far has been developed on the 
basis of geographic area, assuming an even distribution of population in 
townships. School district population Is to be determined by a sum of 
township populations lying in the district Including any townships which 
also fall within other districts. The population of these township 
fractions will be determined by the product of the percent of the area 
lying within the district in question and the total population of the 
township excluding any urban areas. 

Three rather flat and evenly populated areas of Iowa have been 
selected for pilot study. One problem with Iowa is the poor map coverage, 
including school district maps which vary in scale, accuracy and readability. 
Hence no accurate comparative measurements may be obtained from the maps 
themselves. 

An aim of the pilot study is to determine a method of partitioning 
^a t a by some kind of geographic division of the units concerned. One 
possibility is the transference of conclusions from data of one type of 
geographic area to another type of area. A research project in Texas 
instigated by Title I of the Elementary and Secondary Education Act 
revealed that the population in urban areas accounts for much of the 
school district population. 

There may be some difficulty in coordinating the two basic sources 
of information used in the project, namely the Iowa State Department of 
Education and the Bureau of the Census. There is the possibility of 
using the coordinate systems used by the federal census as a basis for 
analyzing data, but these coordinate systems are planned by the states 
and manifest sane variation although they are becexaing more popular for 
mapping purposes. An attempt has been made to combine school district, 
civil division, and county maps in order to identify school districts. 

The results were compared with larger scale maps provided by the counties 
in question. Estimates and measurements were made from both large and 
small maps. Considerable variation showed up in some cases. 

The possibilities of computerizing data from maps including school 
district boundaries must wait the technical achievement of transferring 
the unit of census school district into a computer. The census 

data are or will be already in c<nputerized form. The census plans for 
1970 include the capability of identifying an area in terras of blocto 
or block phases \hich when grouped and tabulated by computer will give 
information about school districts as well as other districts. The 
boundaries will be ordered on the basis of computer Information, but such 
information will apply only to urban areas. 
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A difficulty associated xd.th data other than total population of 
school districts nay be alleviated by taking the school district boundar- 
ies to the enumeration boundaries of the census bureau. It has been 
discovered that school district maps obtained froo local school super- 
intendents and the enuneration district maps of the Census Bureau may 
be successfully combined. The enumeration district » however, limits the 
selection of variables used, which is a problem only when selected 
variables are desired • The cost of obtaining e num e r ation district tapes 
is estimated roughly at $250,000, exclusive of mapping costs* 

There are a number of problems related to projects of this nature. 
There is, for instance, the question of whether to estimate area per- 
centages or to go to the trouble of measuring them. Also there is a 
problem of whether to use these rougihly constructed maps, or maps from 
the school superintendents* or state highway maps which plot dulling 
units. A problem with converting data to school units is the high 
flexibility of school unit boundaries. Another difficulty concerns the 
determination of high school attendance areas. In Florida, for example, 
this cannot be determined directly by school district population. In 
Iowa where this may be determined by school districts, the districts cut 
across township boundaries. Finally, with the county as the unit of 
analysis, the internal heterogeneity of the unit will be so great that 
adequate comparative analyses will be impossible unless either smaller 
units are considered or variables are selected which reflect the 
differential character of the units. 



Esti»^<^tion Techniques for Hierarchical and Disconsp nant Units 
(Wiley, 1967) 

The Sale presentation dealt with disconsonant units such as school 
districts and townships with incomparable boundaries and divergent sets 
of data. The present discussion will be concerned with hierarchical 
units* units of analysis which are completely contained in other units 
of analysis. The different levels of these hierarchies will yield data 
not available at other levels* hence there is a problem of comparability 
of units. The purpose at: hand will be the discussion of some possible 
solutions to this problem. 

The problems of hierarchical and disconsonant units affects the 
type of research which* on the basis of indirectly available data, attempts 
to determine variables to characterize a given unit of *«alysis; in 
particular, attempts to characterize sdiool districts on the basis of 
data available only at larger units. Consider the problem of coordinating 
two sets of data, one available only at the county level, the other only 
on the school district level. How could differentiating variables for 
school districts be determined from county data? 



One solution to the problem might be to do a regression analysis of 
the constant county values on the variable district values. Linear 
dependence may be corrected by mean deviating the predicted values from 
the predicted county mean and adding back actual county values. The now 
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llnesr Independent scores will still di££erentidte districts* A second 
solution, assuming no substantial bias associated with the groupings o£ 
districts within a county, would begin by tabulating county variables by 
cumulation o£ district characteristics* A regression equation then 
established on the county level could be used to predict county scores 
and then fed back to the district level to predict district scores* These 
predicted values could then be mean deviated and the actual values added 
back* One might also complete analysis of the covariational structure 
of a relatively heterogeneous area where complete data are available using 
random sampling* Equations established on the basis of this area could 
be applied to other areas* A third solution might be to subject raw 
county values to a composition program yielding scores on composite 
variables which will give some optimality characteristics to eventual 
composite variables* 

All of the solutions are inadequate to the extent that they overlook 
actual social class considerations* All, of course, are solutions only 
to problems of hierarchical units* Disconsonant units, usually lower 
level units, may be merged until hierarchical structure begins* 

Psuedo*hierarchies might be established by a random grouping of lesser 
units using variables which would preserve the structure of the lower 
level, yet by virtue of attenuation would allow generalization back down 
to the lower level* Non-random groupings may be used only if the created 
unity in itself is of interest for further study* If different theoretical 
variables are used at different levels and one then attempts to make 
comparisons at a low level of the hierarchy, the resulting mass of types 
of unit may be controlled to some extent due to the effects on lower level 
variation by the structure at higher levels* 

It is possible, finally, to achieve an orthogonal or uncorrelated 
classification system for the purpose of characterization at each level 
of a hierarchy by doing separate analyses on each set of data and then 
doing composites to characterize the units* 



Ir/inlif^attona of the Choi<^ of Units and Te chniques 
(Hasatir, 1967) 

Since projects similar to those presented today are likely to set 
precedents for the solution of school problems, it would be best to assess 
their proposed procedures for future effectiveness* 

There is, among others, a critical problem about the relevance of 
the data used in these projects to the results desired. The reason for 
this is an inclination to accept whatever data are available* Yet, the 
selection of data, to be significant, should have some theoretical basis; 
there should be some reason for supposing the data used are relevant to 
the outcomes desired. What is desired is an operative model of causation 
which would include the problems of feedback and the method* What is 
done, simple description and redescription, will not produce the variables 
adequate to such a model* 
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There Is s. tendency, compatiblw' v/ith the interests of soiae data 
gatherers, to over*eiaphaslze similarities and ignore important individual 
differences. Generally, data sources are published documentations, which 
set limits to the possible kinds of recombination. Flexibility, however, 
is essential to the eventual effectiveness of the system. School district 
situations are constantly changing, hence rendering data bank material 
obsolete. It might be better, for example, to substitute definitions in 
terms of geographic corrdinates for district names. Moreover, one must 
contend with the structure of variables with which data reporting systems 
are concerned. 

In general, though not in every case, the application of data per- 
taining to larger units of analysis, e.g. , the county, to analyses of 
smaller, less heterogenous, units is of questionable utility. It is 
not clear, furthermore, that the information provided by a high level 
institution such as the U.S. Office of Education is relevant to crucial 
problems at lower levels. The aim of this project, i.e., the discovery 
of the impact of the social characteristics upon educational outcome, 
will be frustrated so long as we continue to accept available documented 
data without a plausible theoretical basis for selection. 



Applications to the Stratification of Schools (Johnson, 1C 67) 

The Central Uidwestern ilegional Educational Laboratory (CEi'JREL) 
exists for the purpose of helping to bridge the gap between research and 
discovery and diffusion rnd improvement in the classroom in connection 
with a region which includes portions of Kentucky, Missouri, Tennessee 
and Illinois. The Laboratory's concern is to develop research designs 
especially applicable to educational research and development efforts 
occurring at the multi-state or regional level. Its primary aim is the 
developraent of a stratification of all the sub-units of regions in the^ 
fashion discussed in the Wiley presentation. The most promising solution 
at present involves a modification of the analytic procedures and sampling 
methodology originally developed by the Wisconsin group. The data con- 
sidered relevant include "factors" derived from all characteristics ^ich 
can describe units, and are available in part through the state depart- 
ments of education and on the national level. It is anticipated that 
regional stratification on the county level vrill allow stratification on 
the district and, eventually, school levels. The research designs 
developed will be applicable to field projects which will Implement 
curricula through pilot studies in selected schools. 

The laass of data, the large number of variables and indices, con- 
sidered relevant to the CEI'IREL project suggests that a theoretical basis 
of selection may be necessary. A counter-suggestion, however, is that 
the important practical aim of the project is to provide adequate guides 
at any moment for educational decisions, hence the justification of the 
stratification developed is not in question, and conceptualization, a 
long-tert; scientific process, may be kept at a minimum. The important 
point is that the data cannot be obsolete if they are to be useful. Dat; 
obsolescence night be alleviated, for instance, at the county level by 
introducing a constant of dating of Inforriation and stratification into 
the system. 
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Applications to Size a nd Hon-Dollar Costs of Secondary Schools 
(Wakefield, i:67) 



A specific probleu about the comparative costs of expanding an exist-* 
ing high school and building a new one to meet enrollment demands has led 
to the exploration of the general problem of the relation of quality to 
size in secondary schools. Quality might be determined, as it usually 
is, by economic factors, or by so-called non-dollar productivity, which 
may consist of a set of complex variables. 

The present investigation began with the ideal that, given a number 
of indicators and a notion of the desired productivity in a particular 
community, one might be able to determine the appropriate high school size 
under those circumstances. Changing situations and consequent productivity 
expectations could probably be treated as operational problems. One 
could, for instance, adjust structural flexibility to population flux 
prognoses. An unique aspect of the investigation has been a concern for 
composite variables, or qualities, rather that stratification. ^Hie project 
has been restricted to readily available data due partly to considerations 
of inmiediate utility and clientele expectations and partly to discover 
the utility of existing data in practical decision-making. Two diverse 
states with extensive available educational data were chosen for study 
in this first phase of the project. 

The methodology consists of the determination of variables, or 
principal component scores, by interrelational groupings of the original 
data, factorization of these scores by image analysis and rotations, 
regression analyses with the image factor scores, and, finally, graphical 
and tabular summarization of the results. 

An advantage of the arrangement of data used may well be the greater 
possibility of evaluating the reliability of composite variables than of 
simple variables. The aggregation of items demanded at most levels of 
the methodology xd.ll presuppose well-developed theoretical formulations 
or commitments which will be capable of testing* The decisions about the 
selection and naming of variables and factors xdiich determines the out- 
come of the striy must be done on an a priori basis. 
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CHAPTER 3 



PLANS AND METHODS OF INVESTIGATION 



In Chapter 1 it was stated that the ultimate aim of research of 
the type conducted in this report is to create an algorithm for 
designing schools. Inputs of the algorithm will be the quantitative 
values an administrator might choose for manipulation, such as the 
size of the building, and outputs will be the qualitative values of 
the dependent variables, such as pupil achievement. In essence, the 
algorithm will enable the educational planner to compute the effects 
of different combinations of Independent variables on the type and 
degree of transformation induced by his school -to-be. 

The school will thus be viewed as a transformer, accepting a 
variety of inputs and accomplishing various designable transformations 
before divulging its output. 

For any real transformer a tetrad of relevant information can be 
described: (1) input data, (2) data concerning the transformer itself, 

(3) output data, and (4) data relating the transformer to its environ- 
ment. The design of the transformer will consequently be dependent 
on amassing data which adequately describe the characteristics of the 
input and environment. The design also requires data describing the 
output characteristics, for the success of any proposed transformation, 
by definition, demands a comparison of output and input. 

A prerequisite for the study, then, was data which accurately 
characterized the input, output and environment. However, data alone 
will not establish the design algorithm; there must be an underlying 
theory and sat of definitions to meaningfully relate the data to 
categories for design. 

Establishing a theory and set of definitions concerning design 
problems had to be another goal of the research. 

The final goal was to generate a mathematical methodology for 
relating the data to the theory and definitions established. To 
test the methodology, two states with extensive educational data banks 
Iowa and Florida, were initially selected for examination. 

In Section A of this chapter the methodology is discussed more 
completely; in Section B a survey of various environmental facfors 
which summarize the educational climate extant in the 48 continental 
states Is presented; in Section C the types of input and output data 
available in 24 states are indicated; in Section D the problems 
encountered in trying to order ttie Florida data are explained. 



i.. Sequence of Operations 



Three phases of sli.tultaneous research were undertaken. Phases 1 
and 2 concerned attempts to related existing input, output, and 
environment data to a cornmon unit. This process was basically the 
same as the couanon-denominator method for relating different fractions 
to the same base. For this research, the common denominator for all 
data was defined as the school. The methodology and plans for investi- 
gation are summarized in Figure 3-A-l. 

The major sources of the data needed were the United States Census 
Bureau and the state departments of education. The source data requ'*red 
special preparation for analysis; much organization and interpretation 
had to be made before the structure and substance of the content of 
the census and educational data banks was apparent. The data stored 
in the educational data banks were sifted and then indexed in order to 
facilitate the definitional clustering which was part of the methodology. 

The census data had to be related to school districts; the Census 
Bureau divides communities into sections, the boundaries of which are 
seldom coterminous with school districts. All data, however, had to^ 
be related to the school. Therefore, a method was devised to determine 
the unit-by-unit correspondences betv7een census and school boundaries . 

The third initial phase of project activities involved organizing 
and indexing relevant results of relevant previous investigations; the 
concerns of preparing the theoretical framework involved the cream of 
several different scientific domains. Much of the inforuiation 
gathered for this effort has been summarized and presented in Chapter 2. 



3. Survey of Forty-Eight States. Educational Climate 



As explained in the introduction to this chapter, Iowa and Florida 
were selected as the states on which the developed methodology was to 
be tested. These states were selected because it was believed their 
educational data banks would allow the most input, output, and environ- 
mental data to be examined* However, a discussion limited to just Iowa 
a pH Florida might also be limited in application. To determine some 
of the general aspects of the variation in educational climate in more 
states a survey of the forty-eight contiS'^'^® states was made* 

The presentation of the survey \7hich fellows is divided into 
four parts. First, nine theoretically defined constructs hypothesized 
to be related to educational climate are presented and notes are made 
on how they would ideally be measured. Second, a clustering of avail- 
able demongraphic and school data into categories corresponding to the 
constructs is presented. For each cluster, a composite score was 
produced for each state; the empirical analysis into composites is 
shown to lead in some cases to revision of the construct definitions. 
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Phase 1 


Phase 2 


Phase 3 


Acquiring and analyxlng 
the contents of the 
data bank descriptions 
and selection of the 
. data to order. 




studying and testing 
of various schemes 
for reconciling 
hlerarchlal and 
disconsonont units. 




Surveying and 
annotating and 
Indexing the relevant 
literature . 








Obtaining and reorgan- 
izing the data bank 
files and production 
of the flc.al Indexes 
of data Items. ^ 




Determining the unit* 
by-unlt correspondences 
between census and 
school boiukdarles* 




Synthesis of a theory 
of the relatlonrhlps 
between school size 
and productivity* 


% 







Transforming all data 
Into a common (school) 
set of units* 



V 



Compositing the Input 
data according to the 
theoretical categories. 



Regression across 
the Image factors* 



Determination of the 
adequacy of the 
available data for 
measuring according to 
the defined theory* 



I 




Figure 3--A-i Sequence of project activities 
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Third, the tzztor structure the iooposite scores is presented anc? 
interpreted. Fourth, the factor scores derived frc-i the 
structure are presented in various forms which affect stratifications 
of the forty-eight states. These stratifications are used to provide 
a context for the secondary school qualities of Ioi«« (For 
of completeness, the context of Florida is also considered, altho^, 
as explained in Section D below,the Florida data was not analyzed 
extensively* 



Definit ions a nd Ideal Indicators of the Constructs 

nine constructs were defined, partly logically and partly by 
consideration of possible data sources. Siturtion, Heeds and Innova 
tion, and iiotivation and Ability were the three superconstructs from 
which the nine constructs were developed. 



Educational Situations of a state refers to the educational process 
as it presently operates in the state* Ideally, one would consider 
measures of pupil input such as family background, natural ability 
(IQ, etc.) and elementary school achievecient. In addition, corner ^sons 
of high school curricula and measures of pupil output would be 
able. Ho data, however, were available in comparable form for all » 
states and four saore prosaic measures were used— teacher quality, 
school size, state control, and funding* Teacher Quality ref era to 
the ability, training and performance of teachers. A measure of teach- 
ing performance such as correlation between pupil ability and 
achievement would be best, but this is not available* A second desirable 
measure would be a comparison of teacher training such as Ac numoer of 
university credits per teacher. However, this is not obtainable due 
to the different criteria for teacher accreditation used by each state. 
School Size refers not only to the number of pupils, but also to measures 
of departmentalization and specialization* The number of secondary 
school departments and Ae number of secondary school courses offered 
are examples* However, comparative data on these measi^es has not 
been collected. State Control refers to state regulation of textbooks, 
courses, and teacher qualifications* The number of high school courses 
required by Ae state and the number of standardized tests employe 
may represent measures of state regulsfio®* However, there are few 
such data comparing regular secondary school curricula* Funding refers 
to the degree of local, state, and federal support for education. A 
.Measure of solvency suA as the school district indebtedness ratxo 
is desirAle. However, the best that can usually be obtained is a break- 
down of educational funds into federal, state, and local sources. 

Meeds and Innovation refers to inferences about the present and 
future problems of the school system. The expansion and shift of 
population in a quantitative sense leads to crowding certain areas aad 
relative underpopulation in others. This change involves the destruc- 
tion of former secondary schools and the building of new ones which 
are generally larger* During the period of transition, overcrowded and 
undercrowded schools exist. The qualitative shift of population to 
urban areas and especially of non-wnites to Northern urban areas has 
more than added to the school population; it has added problems of a 
lower income group from families of a lower educational level* Th*» 
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opposite situation is the continued existence of secondary schools 
in population*depleted rural areas with attendance difficulties of 
maintaining a varied curriculum and a good teaching staff, ^io con- 
structs are defined. Quantitative Population Change refers to popula- 
tion eiqiansion in numerical terms; one is interested in ascertaini^ 
the potential number of secondary school pupils. 9*«*^^*^**^^^^ Population 
Change refers to the changing composition of the population in terms 
of racial, urban-rural, and occupational differences. It is best 
indicated by measuring socio-economic and urban-rural heterogeneity. 

Motivatiwi and Ability refers to the desire and ability of the 
population of the state to improve education. Motivation is best 
reflected by the comnunity's interest and participation in educational 
affairs. Support of school bond referenda, percent voting in school 
board elections, and PIA membership are all examples of c ommu n ity 
involvement. Houever, we are not able to obtain this Information 
uniformly and must use two less desirable indicators of motivation: 
Social Resnonsibilitv and Educational Attainment . fr>cial v^« pftnaibilitY 
refers to the proven interest in group action as reflected by voting, 
contributions to charity, amount of unionization, and degree of 
insurance coverage. Educational Attainment refers to the educational 
level of the community and its reflection in types of occupation. 

Ability to Bay refers to both wealth and other sources such as non- 
public schools that are used to defray public educational expenditure. 



The constructs are summarized below: 



Construct 



1. Teacher Quality 



2. School Size 



Definition 

The degree of ability^ 
training and perfor- 
mance of the teaching 
staff. 



The size, departmental- 
ization and specializa- 
tion of secondary 
schools. 



Ideal Indicators 

1. Correlation between 
student adiievesient 
and pupil ability. 

2. Number of university 
credits per teacher. 

1. Number of extra- 
curricular activities. 

2. Number of courses 
offered per secondary 
school. 



3. State Control 



Control by state of 
various aspects of 
education. 



1. Degree of state con- 
trol over teacher 
accreditation. 

2. Amount of state con- 
trol over educational 
curriculim. 



o 



- 37 - 



Construct 
4. Fundins 



5* Quantitative 

Population Change 



Qualitative Popu- 
lation Change 



7* Social Respon- 
sibility 



S« Educational 
AttalnTiCnt 



9* Ability to Pay 



definition 

The aoney spent on edu- 
cation as derived froii: 
state, local, and federal 
sources. 



The change In popula- 
tion through birth and 
migration and Its effect 
on public secondary 
school enrollment. 



The changing coo^K>s5.tlon 
of the population In 
terms of racial, urban- 
rural, and occupational 
differences. 



Interest and Involve- 
ment in civic and 
comaunlty affairs. 



The educational level 
of the population. 



The potential monetary 
resources available to 
the cocoDunlty for 
education. 



Ideal iiiuicatora 

1. Indebtedness ratio 
(ratio of school debt 
per school district 
valuation) 

2. Per-pupil expenditure 
from federal sources. 

3. Per*pttpil expenditure.^ 
from state sources 

1. Potential number of 
secondary school 
pupils . 

2. Percent of over- 
crowded schools. > 

1. Socio-economic 
heterogeneity of 
potential secondary 
school enrollment. 

2. Urban^rural hetero- 
geneity of potential 
secondary school 

. enrollment. 

1. Percent voting for 
school board memberG. 

2. Percent voting for 
school board refer- 
endum. 

3. Percent belonging to 
PTA. 

1. Median yrs. of 
education. 

2. Status Inconsistency. 

1. Wealth of state 

2. Non-public sources of 
educational funds. 



Indication of the Constructs 



Direct laeasurecient and» in fact » id eel indication fiere not possiole 
because of lack of data. Instead* several indicators were gathered ror 
each construct and a composite score was derived by the Esetbod of 
principal cooponents. Data from the period 1S58-62 were found to be 
most readily available and were, therefore, preferred. The titles of 
the Indicators are given in Table 3-B-l grouped by construct and with 
principal component loading. The principal component analyses were 
performed within clusters. In some cases the principal components analyses 
suggested a revision of the definition of the construct. In the paragrap s 
which follow, the indicator composition of each cluster is explained and 
rationalized and any revision of constructs is described. 

Six indicators were gathered for the construct. Teacher Quali ^. 

These were average teacher's salary, proportion of male teachers, pupil - 
teacher ratio, supervisors and department heads per pupil, guidance ^ 
counselors per pupil, and librarians per nijpil. The average teacher 
salary might be regarded as an indirect iiiicator of teacher cjiiality since 
higher pay should attract better qualified teachers. However, salaries 
are to a large part determined by factors unrelated to teacher cuality. 

The proportion of male teachers may, likewise, be related to teacher 
quality. Hale teachers tend to be younger, are more apt to seek h g er 
degrees •‘and increased specialization, and are more involved in extra- 
curricular activities, especially in athletics. The pupil-teacher ratio 
might be expected to correlate highly with teacher quality since SMller 
classes are felt by many persons to be more effective places in which 
to teach. The number of supervisors and department heads per pupil, 
number of guidance counselors per pupil and the number of librarians per 
pupil are indicative of departmentalization and a more specialized teach- 
ing staff. The construct of high teacher quality is positively correlate 
with average teacher's salary, percent of male teachers, nuiAer of depart- 
ment heads and supervisors per pupil, and nundier of guidance couns^ors 
per pupil. It is negatively correlated with pupil-teacher ratio, which 
means that a low or favorable pupil-teacher ratio is positively correlated 
with teacher quality. It is also negatively correlated with the number 
of librarians per pupil. The variation in state accreditation of secondary 
school librarians and the existence of substandard librarians in many 
states might account for this negative correlation. 

For the second construct. School Size , five indicators were chosen. 

Uiey include the percent enrollment in small secondary schools, the per- 
_cent-enrollment in medium-sized secondary schools, the change in t e 
number of secondary schools, the number of additional classrooms nee e , 
and the number of excess secondary school pupils. The enrollments in 
small end in medium-sized secondary schools are a reflection of specia iza- 
tion and departmentalization since larger schools tend to be more specialised 
and offer c wider range of courses and a greater number of extra-curricular 
activities. The change in the number of secondary schools is an indication 
not merely of construction, but also of consolidation. Secondary schools 
recently built tend to be larger vdiile older schools which are bein^ 
torn doxm are mainly smaller ones. The last two indicators, the number 
of additional rooms naadad and tha nutibar of excass sacondary school 
pupils, are ’ooth indicators of crowJing. Crowding often creates school 



TABLE 3-B-l 



CONSTRUa INDICATORS AND PRINCIPAL 
COMPONENT LOADINGS 



1 • Teacher Quality 

+84 Average secondary school teacher's 
salary 

+62 Percent of male teachers 

+76 Supervisors per pupil 

+84 Guidance counselors per pupil 

-42 Pupil -teacher ratio 

-M Librarians per secondary school pupil 

f 

2. School Size 

-90 Percent enrollment in small secondary 
schools 

+89 Percent enrollment in medium-sized 
secondary schools 

+68 Change in number of secondary schools 

-11 Additional classrooms needed 

+03 Excess public secondary school pupils 



3. State Control 

+62 Number of state-required courses 
+79 Number of state-recommended courses 
+82 Number of state-guided courses 



Quattititative Populatio n Change 

+84 Population increase ( 

+82 Population mobility 
+70 Birthrate 

-22 Population under 15 yeors 

+85 Change in public secondary school pupils 

+48 Percent of population under 15 years old 



Qualitat ive Populat-lnn Chance _ 

+39 Net migration white population 
+81 Net migration non^nite population 
-62 Percent Negro 
-51 Change in urbanization 
+80 Percent urban 

-08 Percent engaged in manufacturii^ 

-09 Percent of technical and professional 
workers 

Social Responsibility 
+64 Percent voting 

+89 Percent covered by hospital insurance 
+73 Expenditure for parks 
+78 Amount raised by community chest 
+87 Membership in AFL-CIO 



4, Funding 



-42 

-70 

83 

08 

-45 

93 



Percent state funds 
Percent federal funds 
Per capita state educational expendi- 
ture 

Percent spent on education 
Percent spent on public welfare 
State educational expenditure per 
pupil 



8. Educational Attainment 



+89 Percent with 4 years of high school 
+82 Percent with 4 years of college 
+90 Median school years 
-69 Percent illiterate 
+71 Number of doctors 
+77 Number of psychologists 
+85 Number of technical and professiorxil 
workers 

459 High school graduates 



9. Ability to Fav 

+92 Per capita income 

+24 Change in per capita income 

-93 Poverty 

+71 Percent in non-public secondary school 
-08 Change in non-public secondary school 
enrollment 
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.hifts and adversely effects s^y of the ben^^« ’ 

ertra-curriculet activities. The .“e u 

the indicator Intercorrelations is larger ^ the 

positively with the percent enrollment in m^iw negatively 

change in the number of public secondary “ i^ooU. It Is 

collated with percent enrollment in •<«“ 'tterffow. the 

not related to either of the crowding *^'f‘eiu“es 

construct is that of size rather than utilisation of faciiit 

The third construct. State Con^ . »nif,:4*,^^"'c^rre‘s.'‘ 
number of state-required courses, „„ comparative data 

and number of state-guided courses. secondary school 

are available on state control wr the ^ courses such 

curricula. The indicators are for sta e innovated as of December, 

as driver education and conservation J reliance 

1555. Ho new data were available on a ““^“at^ united the value 
on old data of a more restricted “t“te would hw indicators these 

of these indicators. In the “>>»““• hi^ Lid tL construct 

are used. The indicator inter correlations sre hign ana 

measures state control over education. 

The fourth construct, Fundi^. consists of ste^ i ^l^f®"^^ P 

state funds, percent federal ^™<’®'^:“^tture, percent sp«t on 
expenditure, per-pupil state educati^l e^it«e, ^ ^ 

education, and percent spent on p^Uc sLiunt spent 

per-pupil state educati^l ®f ®^“"”'1ir«r!lpu^triM“ educational 
by each state on education. In addxtion, the p p p . 

expenditure when compared to the ?? education. The percent of 

consideration the burden borne by non-^ ,„n.ces is related to solvency, 
educational funds from federal a^ sMte indicative of a lack 

Greater federal aid is associated with ^d ^ w ^ education 

of resources from state and local sources, tte ^rcent 

and the percent spent on public "* ”«te^fLids into education. 

Willingness of a poorer state “ correlates 

The construct turns «r-MPltt st^ educational expenditure 

positively with per-pupil and ^r MplM sc 

and nerati/ely with percent federal f^s ^ educational 

percent state funds. This indicates that inmates irtthh^ne 

expenditure a large part of the fun s ducational expenditure and the 
is’^no correlation between the ®»“f f ^Zh negative 

percent of state funds spent on education and there xs a 

correlation with the percent spent on public welfare, 
nwr. M. #« Indies. 

measuring population increase. 

There are seven indoors of 

include net migration of w^te an -greent engaged in manufacturing, 

oercent urban, change in urbenizati-on, p 



and percent who are technical and professional wrkers* The net laigra* 
tion of white and non-white population indicates a differential popula- 
tion change which is reflected in different educational needs. This is 
likewise true of the percent urban and change in urbanization. The 
percent Negro, percent engaged in aanufacturing and percent who are pro- 
fessional and technical workers reflect the heterogeneity of the 
population. The construct correlates positively with net aigration of 
non-white population, percent urban, and to a lesser extent net migration 
of white population. It is negatively correlated with percent Negro and 
change in urbanization. The revised construct is, therefore, found to 
be a measure of urbanization. 

The construct. Social Responsibility , consists of five indicators 
percent voting, percent covered by hospital insurance, expenditure for 
parks, amount raised by coiociunity chest, and membership in APL-CIO. A 
more civic— oriented and socially responsible community as measured by 
the above indicators would be more likely to show interest in its edu- 
cational system. The indicator intercorreiations for this construct 
indicate a high positive correlation with all of the above indicators. 



The construct, Bducational Attainment , consists of eight indicators 
divided among educational level and occupational level of the population. 
These include percent with four years of high school, percent with four 
years of college, median school years, percent illiterate, number of 
doctors, number of psychologists, percent of professional and technical 
workers, and number of high school graduates. Educational Attainment 
may reflect conanunity interest and involvement in educational affairs 
since a more highly educated and literate population will more likely 
strive for better education. The indicators have high intercorreiations 
(positive, except for percent illiterate) and the construct measures 
attainment. 

The final construct. Ability to Pay , consists of five indicators: 
per-caplta income, change in per-capita income, poverty, percent attending 
non-public secondary schools, and change in non-public school attendance. 
The pc.r-capita income, change in per-capita income, and poverty indicate 
the wealth of the community which imposes limits on the funds available 
for education. The percent in non-public secondary schools and change 
in non-public secondary school enrollment reflect a source which takes 
away some of the burden of public education. The indicator inter- 
correiations are highly positive for per-capita income and percent enroll- 
ment in non-public schools, and highly negative for poverty. The construct 
measures the ability to the state to pay for its educational system. 



Factor Structure of the Composite Scores 

The composite scores for the clusters are to be considered more 
fundamental than the indicators on which they are based. But they are 
considered to vary in a multivariate interrelationship. Therefore, a 
principal component analysis vas performed on the nine composite scores* 
The intercorreiations. principal component loadings, and varima.<c loadings 
are presented in Table 3-D' 2. To determine the varimax loadings, the 




42 - 



TABLE 3-B-2 

INTERCORRELATIONS AND FACTOR STRUaURE 
OF THE COMPOSITES 

Correlations 





1 


2 


3 


4 


5 


6 


7 


8 


9 




1 


100 


60 


02 


+78 


04 


78 


68 


77 


91 


i 


2 


60 


100 


05 


+45 


24 


61 


50 


63 


64 


1 


3 


02 


05 


100 


+15 


-01 


11 


27 


18 


16 




4 


-78 


-45 


-15 


100 


-04 


-62 


-74 


-78 


-81 




5 


04 


24 


-01 


+04 


100 


01 


-30 


21 


04 




6 


78 


61 


11 


+62 


01 


100 


65 


69 


80 




7 


68 


50 


27 


+74 


-30 


65 


100 


62 


78 




8 


77 


63 


18 


+78 


21 


69 


62 


100 


81 




9 


91 


64 


16 


+81 


04 


80 


78 


81 


100 












Principal 




















Components 


Rotated Components 






1. 


Teacher Quality 






+92 


93 


-01 


-07 






2. 


School Size 






+72 


73 


31 


-04 






3. 


State Control 






+19 


07 


-00 


99 






4. 


Funding 








87 


+86 


-05 


+13 






5. 


Quantitative Population Change 


+06 


04 


97 


01 






6. 


Qualitative Population Change 




+85 


86 


-01 


01 






7. 


Social Responsibility 






+82 


81 


-39 


24 






8. 


Educational Attainment 




+88 


87 


21 


14 






9. 


Ability to Pay 






+96 


95 


-01 


09 
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three unrotated couponents corresponding to latent roots of the corre* 
latlon matrix i^ich were greater than or equal to 1.0 were rotated. 

The principal component and the three rotated components are described 
in the following two paragraphs. 

The principal component, as noted on Table 3-B-2, correlates 
positively with all but composites 3 and 5. There is a high positive 
correlation with Teacher Quality . School Size . Fund ing 
Population Change or Urbanization . Social Responsibility , Educational 
Attainment , and Ability to Pay . A high score would seem to imply a high 
quality of secondary education both in terms of ability and of performance. 
There was no correlation between the principal component and the con« 
structs of State Control and Quantitative Population Change . 

The first rotated component is essentially the same as the principal 
component, and might be summarily termed Quality of Secondary Ed ucation. 

The second unrotated component consists essentially of composite 5, 
Quantitative Population Change or Growth . A high score on the factor 
should imply a great increase in population and a consequent need for 
change in the secondary school system* The third rotated coaq>onent 
consists essentially of composite 3, State Control . As mentioned above, 
there were no data available on a comparative basis for regular secon- 
dary school curricula* The data used were old and were related to 
special recently-introduced courses such as conservation, atomic energy, 
and driver's education* As a result, this third component was not used 
since it did not give a complete and accurate picture of state control 
over education* 



Stratification in the Context of Iowa and Florida 

The scores for each state corresponding to the principal component 
and to the three rotated components were computed. These scores provide 
a means of stratifying the 48 states. The first stratification is non- 
dimensional and consists of the ordered list of principal component 
scores* A diagram containing the ordering is presented in Figure 3-B-l. 
The divisions between north and south and between urban and rural with 
regard to educational quality are noteworthy, lloreover, the highest 
educational quality is found in big-city states from the urban north, 
followed by the urban midwest and west, the rural midwest and west 
(including the rural northeastern states of liaine and Vermont), and 
finally the predominantly rural south and border states. Of the ten 
states with the lowest educational quality, eight are southern and two 
are border states* 

As previously explained, the third rotated component was difficult 
to interpret. The data on which it was based were older and of a special 
nature. However, the second factor is clearly interpretable as coming 
needs' and a bivariate stratification with rotated factor 1 provides a 
more detailed stratification of the states. This is presented in 
Figure 3-B-2. The second factor. Population Groxyth . interpreted in our 
context as "coming needs" does not fit into any*, regional or urban-rural 
pattern as does the first factor Educatiofial Quality . Examination of 
the graph in Figure 3-B-2 reveals that one quadrant is virtually empty. 
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Practically no states which have had a rapid population growth also 
have very poor secondary school systeos* Growing areas almost all have 
average or better education in terms of the constructs representing 
ability and performance • Iowa and Florida are about in the middle and 
are together in the univariate stratification on general quality 
(Figure 3-B-l). Iowa's position with respect to Educationa l Quality is 
similar to other rural midwestem states* while Florida shows a better 
quality of education than other southern states. With regard to 
Population Growth and "coming needs" the two states are far apart (Figure 
3-B-2). Again Iowa's position is similar to other states of the rural 
midwest ^ile Florida's phenomenal growth is much greater than that of 
any other southern state. W.th regard to the nation as a whole, Florida 
and Iowa tend toward the middle with respect to Educatio nal Quality. In 
regard to "coming needs" those of Florida well exceed those of the nation 
as a whole while Iowa's are substantially below the median. 

This analysis suggests that Iowa and Florida do not represent a 
wide range of educational quality; they do represent states with a great 
difference in population growth and, therefore, different emerging needs 
and problems. This condition relates directly to the problem of high 
school building and size. In Florida where a large building program 
is in progress, the optimum size of a high school is of crucial importance. 
In Iowa tha problem is one of detaining good education in areas of popula- 
tion stability. The methodology, developed in this project is intended to 
shed lip^t on how both problems can be handled. 

C. Automation of Educational Data 



The educational data— -i.e. , the data concerning pupils, teachers, 
schools, and school districts— were to be obtained for this research 
from the state departments of education in Iowa and Florida. The project 
was restricted to the use of data already available in oiachine-readable 
form and was, in part, a test of the utility of such data in providing 
educationally meaningful research results. An evaluation of the utility 
of the data obtained from the Iowa Department of Public InstructicMi is 
presented in Chapter 5. A parallel line of interest concerns the 
degree to which such data are available throughout the nation. It is 
important to the eventual extension of the present research to ask 
where else such data are available. Two presentations arc made in this 
section which are Intended to answer that question. 

To encourage states to collect and store educational data. Congress 
included Jn the National Defense E d ucation Act (NDEA) of 1958 a section 
providing funds to state departments of edr cation for Imprcving their 
statistical services (Section 1009, Title X). U^n approval of a 
proposal for collecting such data, a state could receive u? to $50,000 
in each fiscal year, provided the federal funds were matched. Figure 3-C-i 
is found in ‘*State Plans for Improving Statistical Services," U.S. Depart- 
ment of Health, Education and Welfare, OE-20028, Bulletin 1961, No. 15. 

It indicates th 3 dollars allocated for educational automation in fiscal 
year 1960 and thus the relative degrees to which the state departments 
of education were at that time moving toward the storage of educational 
data. States which are high on the list may be expected to have rather 
complete data. Iowa and Florida are high on the list and this corresponds 
to the fact that they both now have extensive data banks. 
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To ob*:ain more precise and up-to-date information on the availability 
of educational data, a telephone survey of 22 states was made in August, 
1967. The questionnaire is shown as Figure 3-C-2. A call tjas made to 
the director of data processing in each state or to an individ^l possess- 
ing a similar title. He was requested to indicate which secondary edu- 
cation data has been stored in a central facility. He was also asked 
whether the data were stored in machine-readable form— i.e., on punch 
cards or magnetic computer tape. Items of the data in the survey 
include typical information about pupils, teachers, schools and school 

districts. 

A tabulation of the results of the survey is presented in Table 
3-C-l. The tabulation includes information about Iowa and Florida; 
although they were not included in the telephone survey, the inforMtion 
about them was known from earlier project work. The 22 states in e 
telephone survey were selected at random from the A8 contiiuous uni tea 
States minus Iowa and Florida. The figures in Table 3-C-l should there- 
fore be a good representation of the status of educational automat on 
across the entire United States. A preliminary observation is that 
when data are stored, they are usually stored in machine-readable form. 



A summary of the results appears as Table 3-C-2. This summa^ 
gives for each data item the number of states that have complete data 
in machine-readable form. Such data could be input to the kinds o 
analysis used in this research. It is clear from the table that many 
states have detailed information on teachers and many have enrollment 
and financial figures for schools or school districts. Several states 
have curricular information on schools, (kily one state (Iowa) s 
detailed information on individual pupils. This progression is 
predictable on the basis of the purposes for which the state data banks 
have been established; namely, the state departments have established 
data banks in the course of ordinary data processing needs. Since 
state departments are usually responsible for teacher certification, 
they need detailed teacher information; since they are res^nsible tor 
accreditation of schools and for distribution of state fun 
enrollment, financial, and perhaps curricular information. With the 
exception of statewide pupil testing programs, state depart^nts are 
usually not responsible for processing information about individual 
pupils, so they have had no occasion to include such information 
their data banks. Moreover, in many states the compilation of 
pupil records at the stare level is viewed with considerable apprehension. 

In suranary, although data banks are being built, the infection now 
stored by the 24 states which were surveyed (except for Iowa) is not 
sufficient for the analysis used in this research, since pupil p^- 
ductivity is a major dependent variable complex and requires e 
pupil information. Furthermore, it seems unlikely that pup ^ 
soon be available in many states. That development 

service involvement by the states in the local, processes of scheduling, 
maintaining transcripts, and reporting grades. 
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Survey of Twenty-four States: Education Automation 



1. The University of Wisconsin is engaged in a detailed analysis of secondary school size in 
the state of Iowa. In order to understand the context of the Iowa data, we are surveying 
other states about how much secondary education wjta is stored in a central location and 
about whether data are stored in machine-readable form. 

2. This is merely a survey: no request for access to the data is anticipated. \ 

3. Here is a list of secondary school data items which are stored by some states. We should 
like to know whether your state does store these data (by school) in o central location. 

If the answer is yes, we should like to know whether the data are stored for all public 
secondary schools (or teachers, or pupils), or for just some. We should also like to know 
whether the item is stored in machine-readable form — cards or magnetic computer tape. 

/ 

4. Here is the list: 

STORED MACHl NE-READABLE 



a. Pupil grades 

b. Pupil test scores 

c. Pupils* family 

backgrounds 

d. Pupils' plans and 

aspirations 

e. Teacher salaries 

f. Teachers' education 

and background 

g. Teacher years of 

experience 

h . Teacher assignments 

i. Curriculum offerings 
{ . Number of pupils 

k. Number of teachers 

l. Expenditures 



not at 
all 


for 

some 


for 

all 






• 
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Figure 3-C-2 

Questionnaire on Education Automation 
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TABLE 3-C-l 



RESULTS OF TWENTY-FOUR STATE SURVEY, INCLUDING 
IOWA AND FLORIDA, AUGUST, 1967 



Pupil grccl-is 

i 

Pupi I test scores 

Pupils' family 

Pupils' plans 
and aspirations 

Teacher salaries 

Teachers' education 
and background 

Teacher years of 
experience 

Teacher assignments 

Curriculum offerings 

Number of pupils 

Number of teachers 

Expenditures 



Stored 



for 

all 


for 

some 


not at 
all 


1 


2 


21 


2 


8 


14 


1 


4 


19 


1 


4 


19 


23 


0 


1 


18 


3 


3 


18 


1 


5 


17 


2 


5 


8 


2 


14 


20 


1 


3 


21 


2 


1 


20 


1 


3 



Machine-Readable 



Yes 


No 


1 

2 


1 * 


7 


3 


4 


1 


4 


1 


22 


1 


20 


1 


18 


1 


18 


1 


8 


2 


19 


2 


22 


1 


15 


6 
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TABLE 3-C-2 



SUMMARY OF TWENTY-FOUR STATES HAVING COMPLETE DATA 
IN MACHINE-READABLE FORM, AUGUST, 1967 



Category of Educational Data 

Teacher Salary 
Number of Teachers 
Number of Students 
Teacher Educational Background 
Teacher Experience 
Teacher Assignments 
Educational Expenditure 
Curriculum Offering 

Pupil Test Scores 
Pupil Grades 
Pupil Family Backgrounds 
Pupil Plans and Aspirations 



Number of States Having Com- 
plete Data in Machine-Readable 
form. 



22 

20 

19 

18 

18 

17 

16 

8 

2 

1 

1 

1 
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D. The Data in Florida 



The Florida State Department of Education has for the last several 
years been preparing an information system for education. A state-wide 
compilation of educational statistics has been made and, as in Iowa, 
most of the data have been stored in machi.ie*»readable form. As mentioned 
earlier, it was precisely the extensiveness or data stor^.^e that led to 
the selection of Iowa and Florida as the locales for this research. 

They also seemed to provide contrasting demographic characteristics and 
educational administrative structures. 

The Florida information system has been especially directed at 
teacher and school certification and accreditation. Therefore, the data 
bank contains extensive information on teacher background and assign- 
ments, on school district enrollment and finance, and on school curriculum. 
In the course of this research, however, it was learned that Florida data 
do not include information about individual pupils. Specifically, of 
the 361 secondary schools, 43 have pupil data included in the data bank. 
These 43 schools participate in the System for Processing Educational 
Data Electronically (SPEDE) which enables school schedules, pupil grades, 
and transcripts to be processed and stored in a central facility. For 
the SPEDE schools, the pupil data are as extensive as in Iowa, but for 
the non-SPEDE schools, there are no individual pupil data available. 

The lack of complete pupil data in Florida necessitated a halt in 
the Florida analysis, while the original plans and methods of investi- 
gation continued to be applied to Iowa. It was considered of interest, 
however, to c<x.ipare the general characteristics of the 43 SPEDE schools 
with the 318 other schools. The results of this coniparison established 
that the SPEDE schools could not be regarded as a representative sample 
of the population and, consequently, that analysis of them could not 
have been generalized to the entire state. 

In Table 3-D-l a list is given of 32 variables identified in the 
67 counties of Florida. The variables were ejritracted from the U.S. 

Censu;' bureau City/Cctinty Data Buck (1962). The 32 x 32 correlation 
matrix of the variables was computed on the basis of the 67 counties. 

It was factored according to Harris’s (1962) modification of Guttman a ^ 
(1953) image analysis, and the factors were rotated according to Kaiser s 
(1958) normal varimas: orthogonal procedure. The loadings on the five 
rotated factors \diich accounted for the most variance appear to the 
right of the variables in Table 3-D-l. 

The first factor is readily identifiable as Urbanization ; the urban 
counties in Florida are the wealthy, recently-developed areas. The second 
factor correlates \d.th per-capita and per-pupil local school expenditures, 
and so is called Educ at Iona L Ex pen d i tu r e . Tne third factor is called 
Youth since it correlates positively X'/ith the birth rate and negatively ^ 
with median age. The fourth factor. T rade Growth , indexes sales and trade 
expansion. The fifth factor. Public Weal^ , correlates x/ith wholesome 
financial circumstances. 



TABLE 3-D-l 



FACTOR /MATRIX FOR FLORIDA DEMOGRAPHIC VARIABLES 



1. 


perceni' urban residence 


1 

58 


11 

* 


111 

* 


IV 

-30 


V 

-25 


2. 


percent employed in agriculture 


-47 


* 


* 


21 


* 

* < 


3. 


percent in one -unit hour is 


-30 


20 


* 


30 


4. 


median population age 


29 


* 


-89 


* 


* 


5. 


median school year completed (adults) 


86 


* 


* 


-22 


* 


6. 


percent employed in white-colar occupation 


77 


* 


* 


-21 


* 


7. 


percent in labor force (adults) 


* 


ie 


* 


* 


ie 


8. 


percent voted in 1960 election 


* 


ie 


-27 


* 


ie 


9. 


percent of vote for leading party 


-47 


ie 


* 


35 


46 


10. 


percent of vote for Democrats 


-64 


ie 


45 


* 


27 


11. 


percent non-white 


-45 


ie 


21 


* 


* 


12. 


percent sound structures 


86 


ie 


-24 


* 


-20 


13. 


percent overcrowded houses 


-48 


ie 


54 


* 


* 


14. 


population change 1950-1960 


85 


ie 


* 


* 


* 


15. 


popuiotion density 


26 


ie 


-23 


* 


* 


16. 


in-migration rate 


89 


ie 


T/e 


* 


* 


17. 


birth rate 


21 


ie 


87 


* 


* 


18. 


per-pupil educational expenditure 


ic 


96 


* 


ie 


* 


19. 


per-capita educational expenditure 


ie 


97 


* 


ie 


* 


20. 


per-pupil proterty tax revenue 


-25 


22 


ie 


ie 


65 


21. 


per-capita total local expenditure 


ie 


66 


ie 


ie 


30 


22. 


expenditure-revenue ratio 


ie 


ie 


ie 


ie 


* 


23. 


retaii sales volume 


CO 

CM 

1 


ie 


ie 


90 


* 


24. 


revenue -indebtedness ratio 


ie 


ie 


ie 


21 


83 


25. 


percent preschool population 


ie 


ie 


94 


* 


* 


26. 


percent recent move to new home 


89 


ie 


ie 


ie 


ie 


27. 


percent elementary school population 


-61 


ie 


61 


ie 


ie 


ro 

CD 

• 


marriage rate 


ie 


ie 


ie 


23 


ie 


29. 


median family income 


81 


ie 


23 


ie 


ie 


30. 


trade growth 1950-1960 


-23 


ie 


it 


91 


ie 


31. 


number of unemployed persons 


ie 


ie 


ie 


ie 


ie 


32. 


percent employed outside county 


-37 


ie 


ie 


ie 


21 


Nofe 


: Entries between -19 and 19 are printed as 


and decimal places 


have 


been omitted. 
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TABLE 3-D-2 



COMHARISON OF COUNTY FACTORS WITH SCHOOL TYPE 



Factor 1 : Urbanization 






X‘ = 3.8 (N.S.) 




SPEDE 


non^SPEDE 




High Urbanization 


22 


212 




Low Urbanization 


21 


106 




Factor 2: Educational Expenditure 






X^ = 8.2 (p C.005) 


High Expenditure 


16 


56 




Low Expenditure 


27 


262 




Factor 3: Youth 






X^ = 0.4 (N.S.) 


High Youth 


18 


151 




Low Youth 


25 


167 




Factor 4: Trade Growth 






X^ = 3.2(N.S.) 


High Growth 


17 


82 




Low Growth 


26 


236 




Factor 5; Public Wealth 






X^ = 3.4 (N.S.) 


High Wealth 


8 


106 




Low Wealth 


35 


212 
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In order to evaluate the distribution o£ the SPEDE schools, strati- 
fications of the counties were derived from factors. Fw ea^ factor, 
the counties were divided at the median into *high and low • 
tabulations appearing in Table 3-D-2 were then 

factor, the corresponding tabulation gives the n^ber of SKDE and 

non-SPEDE schools appearing in counties above and n^n-SPEDE 

addition, a chi-square was computed to test whether SPEDE and no 
schools are distributed differently across the factor. 

The chi-square for the second factor. Educational Expenditure , is 
quite significant. This means that if a school is located in a county 
vThich has had a high rate of educational expenditure, then it is more 
likely to be a SPEDE school. This suggests that 

specir.1 qualities, for they have a pre-SPEDE history high educational 
expenditure— i.e., of manifest concern and willingness . 

secondary education. t.a alternative interpretation is that areas with 
high expenditure have been building new schools, and administrative 
Innovation is easier to effect in new schools. In , 

differences doubtless correlate with factors such as teache q y 
community involvement. Ihe SPEDE schools may not be crasidered repre 
sentative of the population of schools in the state. ®*®'®f°”' 
of then alonn is inappropriate in terms of the purposes of the present 

research. 

The results of this analysis suggest that a stratified 
methodology nay well be useful in extending to a national scale th 
present research into high school sise and educational 
The choice in such extension is between selecting schools where Mchine 
readable data are available, thus greatly restricting the generaliaability 
of the results, and sampling schools according to a stratification pi , 
thus necessitating the generation of raw data. 
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CHAPTER 4 



DETERHXHATIQN AND PREPARATION OF THE DATA 



Nbdiflcations of a research plan are to be expected but are not 
always reported in sufficient detail to be useful in subsequent efforts. 
In the course of acquiring and processing data in this research project » 
several unanticipated developments appeared which called for such 
modifications • 

Section A contains an account of contacts and relationships estab- 
lished in the search for data. In Section B is an account of problems 
which arose in the processirig of educational data. Section C is devoted 
to the problems encountered in the processing of census data* 



A* Searching for Data 



The search for data was in essence a search for persons sufficiently 
close to the data who could provide substantive answers to questions of 
data availability and format* However, they needed to rank sufficiently 
high in the administrative str'tcture of the system to reduce the number 
of persons involved in obtaining access to the data* 

In each of the two states, official contact was initially made with 
the Superintendent of Public Education. With his approval, the person 
identified by him as responsible for data processing was contacted. Through 
this latter person, a member of the data processing staff was assigned to 
be the liaison with the research staff. 

In addition, a separate relationship was established with the D.S* 
ftnreau of the Gmsus* Iietters were written to professors of sociology 
in each state asking for information regarding any recent analysis or 
compilation of relevant socio-economic data in that state. 

Several persons were retained as continuing consultants to the project. 
These persons had no direct association with either state* They were 
called upon as the need to settle either substantive or methodological 
questions arose* Tables 4-A-l and 2 list the persons contacted and the 
data manuals obtained. 



/ 



TABLE 4-A-I 

PERSONS CONTACTED IN THE SEARCH FOR DATA 



I. State liaison persons 

A. Iowa: 

1. Dr. Ralph Van Dusseldorp, Director 
Iowa Educational Infomation Center 

2 . Dr. E. Gordon Richardson> Assistant Director 
Iowa Educational Infomiation Center 

3. D. J. Gilliland, Reorganization Consultant 
Iowa Department of Biblic instruction 

4. Professor lyle Shanncm, Chairman 
Department of Sociology and Anthropology 
The Iftiiversity of Iowa 

Professor John Hartman 
Department of Rural Sociology 
Iowa State Uhiversity 

B. Florida: 

1. Robert Sims, Systems Coordinator 
Florida Department of Education 

2. Everett Yarbrough, Systems Coordinator 
Florida Department of Education 

3. Dr. Archie Johnston, Systems Analyst 
Florida Department of Education 

4. Professor T. lynt Smith 
Department of Sociology 
Uhiversity of Florida 

H. Wational^Regional liaison persons and continuing consultants 

A. Continuing consultants: 

1. Dr. David E. Wiley 
Professor of Education 
Uhiversity of Chicago 

2. Dr. David Nasitir 
Professor Sociology 
University of California 



IABI2 4-A-I (cofit'd) 



National-Regional liaison persons: 

1. Dr. Thomas Johnson > Assistant Director 
CEMREL 

St. Ann, Missouri 

2. Dr. Charles E. Ball, Director of School Studies 
Project Talent 

3. Dr. Charles Bonjean 
^ofessor of Sociology 
University of Texas 

Herman F. Miller, Chief 
Population Division 
U.S. Bureau of the Census 

Levis J. Conger 

U.S. Bureau of the Census 

6. Marshall Turner 

U.S. Bureau of the Census 

7. Dr. Byron Ibinson, Director 

Center for Comnunity and Regional AncQysis 
The Ohio State Uiiversity 

8. Arnold A. Beyl, Director 

Division of Data Sources and Stcmdards 
U.S. Office of Education 

9. Dr. Richard Powers, Chief 
Operations Analysis Division 
Economics Branch, U.S.O.E. 



TABLE 4-A-2 



DATA liANUAl^ OBTAINED 



i. 



Format Book 
content: 



lile documentation for the thirty*eight files 
(explained fully in Section B of this chapter). 



2 . 



loin Educational Data Bank I .er*s Manual 

content: data information revealing some items ot 

available data. 



3. Iowa GardPac Administration Manual 

content: instructions to principals and proctors tor 

administering a set of student information forms. 



4. Instructions for Gompleting the Pupil Inventory 

content: instructions to students for completing a 

CardPac questionnaire. 



5. Iowa CardPac Student Questionnaire Summaries 196-^ 

content: statewide summaries of students responses to 

a questionnaire. 



6. Instructions for the Iowa Professional Employees Data Sheet 
content: instructions for completing a teacher 

information form. 



7. Iowa Educational Directory 1965-66 School Year j 

content: listing of districts, schools, and staff including 

identifyis^ code nnbers. 



8 . 



U.S. Census of Housing: 1960 

Availability of Published and Unpublished to ta 

content: information regarding ^ailability cf data items 

oer census unit. 



9. U.S. Census of Population: 1960 

AvalUbllity of tublUhed aad Unpublished Ds^ 

content: information regarding availability of data items 
ner census unit. 
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10. U.S, Census of Population: 1S60 Final Report PC(1)-17B 

General Population Characteristics for Iowa 

content: published tables for the large Iowa census units 

giving data on age, race, household relationship, 
marital status. 

11. U.S. Census of Population and Housing: I960 

Geographic Identificatloo Code Scheme for Iowa PHC(2)*17 

content: names and code numbers for Iowa census units. 



12. U.S. Census of Population and Housing: 1960 

PHC(l) Final Report for the five StiSh cities of Iowa 

content of each of the five reports: published tables of 

data for census tracts, 
map of the SilSA with 
tract lines. 



13. U.S. Census of Housing: 1960 Series HC(3)-158 

City Blocks of Cedar Rapids, Iowa 

content: published taule of data for city blocks, map of 

the city with city block lines* 
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3. Processing the Educational Data 



Both Iowa and Florida possess what is termed here an "educational 
data bank center." These centers store and maintain vast aiiounts of 
data in uachine readable form. In attempting to use these data banks 
for research purposes, problems arose due to the nature and activities 
of the centers themselves. "Processing" was necessary to make the data 
usable. 



The School Units 



Prelude to processing, it was essential to discover the data units, 
the entities to wliich information ascribed. A school unit is a single 
class of organizations or individuals described by educational data. A 
study of the Iowa data documents yielded for Iowa five school units: 

County 

District 

School 

Teacher 

Student 

The units District, School, Teacher, and Student are hierarchical 
units and posed no manipulative problems. In most cases, the district 
contained only one hi^ school, and in these cases, the district and 
the school are equivalent units. In the case of the multi-high-school 
district, the school is considered as a school attendance unit (see 
Appendix B for more detailed definitions of district and school attendance 
units). The district and county are not geographically coterminous, but 
since the school is the focus of this research, and since educational 
data describing counties was of little or no value, the problem of dls- 
consonant units did not appear within the area of processing the 
educational data. 



Problems with the Data Bank 



An educational data bank center receives its funds from the state 
which it serves. Accordingly, the activities of the center are limited 
and oriented to very practical and essential applications— student 
transcripts, teacher record keeping, financial accounting, etc. These 
are not research applications. 

From peripheral contact with the centers « a simplified model of their 
data collection and storing procedures was inferred; this is depicted in 
Figure A-B-1. The source forms, for example, might be teacher personnel 
forms. Accompanying the actual forms would be perhaps a general infor- 
mation pamphlet meant to inform administrators and teachers of the purpose 
of the form; instructions for filling out the source forms might be 
contained in a separate manual or might appear on the source form itself. 
The completed source forms would be collected and coded on cards, for 
exaL':ple. This original machine readable medium contained all the infor- 
mation that was coded froc.'. the forms. 
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transfortiations were perfonaed upon the original information! 
the order of the items of information might be slightly altered during 
the transformation; items of information might be deleted or recoded; items 
of information from two types of source forms might be merged together* 

Each transformation was designed for a very specific application, and each 
resulLed in a second generation medium««a file* Further transformations 
might be performed on and between second or first generation media to 
produce third generation media, and so forth* What resulted was a diffusion 
of items of information and of transformed but equivalent items into many 
files* 



Files were documented by data bank center personnel, but the documenta- 
tion was minimal and designed for use mainly within the center* The first 
problem that arose then, was that no organized information was available 
to adequately answer the questions, *Wiat information is stored? How is 
it coded?" In lieu of the transformations performed, a third question had 
to be made answerable* "Where is a particular item of information to be 
found?" The remainder of this section is devoted to a description of the 
procedures used to make these questions answerable— procedures used to make 
the data hank usable for research purposes* 



The Iowa Format Book 



The Iowa educational data bank center had thirty-eight files in its 
bank when a request for data information was made* Received from Iowa 
for each file was a set of internal documents (see Figure 4-3-1)* Bach 
file was documented by several pages including at least an identifying page 
and an information layout form* Sometimes included in the set was a sample 
of the source forms previously mentioned (see Appendix B, pp* 2-5 for an 
example of file documentation)* The collection of the thirty-ei^t file 
documentations constitutes what hereafter is referred to as the "Iowa 
Format Book." The files of the Iowa Format Book were then analyzed for 
data content* 



The Process of Content Analysis 

Through the process of content analysis, data uere^^^ usable for 
research purposes* The analysis was concerned with "identifying" and record- 
ing the location of basic informational units contained in a collection of 
data. This basic informational unit of data is called an "item*" For 
each item within each file, a Data Item File Sheet was completed (see 
Appendix B-Foldout for an example of a completed File Sheet)* The following 
information was recorded on a File Sheet: 

1* The school unit which the item described, i*e*, student, 
teacher, school, district, or county* This unit applied to 
all items within the same file* 

2* The abbreviated title of the item as it appeared on the 
information layout form* 
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3. A "key-word title" which sunmerized the character of the item. 
The keyword title included: 

a) the school unit, 

b) words giving the general nature of the item, e.g., salary 
or enrollment, and 

c) specific infotioation about a) or b). 

The title would read as an ordinary nominal phrase. Syonymlza- 
tion within the title without loss of conciseness was desirable. 

4. Any further description of the item. 

5. The source used for the Item's description (the sources were 
either the file documentation or one of the information manuals 
listed in Section A of this chapter). 

6. The coding system for the item if applicable. 

7. The source used in determining item six (#6) 

8. Physical information-number and name of file. 

Appendix B gives a detailed illustration of the procedure. Over 
1000 items were processed in this manner. The File Sheets were then 
assigned identification numbers and certain parts of the item Information 
were punched onto sets of computer cards (see Figure 4-B-2). The punched 
Information for an item is called a "recoil", and its identification 
number, a RECID number. 



Indexing the Item Information 

Essentially, the Data Item File (collection of Data Item File Sheets) 
was a library of information pertaining to items of data. An index was 
necessary in order to locate information being sought. For instance, a 
standard book library will have its books identified with a Dewey Decimal 
System number, and will have a card catalogue with subject index, author 
index, and title index. Without these indexes, the library of books 
would be almost unusable. An unindexed Data Item File would be similarly 
unusuable. 

A computer program was developed to automatically produce Indexes 
from the punched version of the Data Item File. Examples of these 
indexes are given in Figures 4-B-3, 4-B-4, and 4-B-5« The produced 
indexes served a purpose analogous to that of the standard card catalogue. 
Immediate answers to the questions of data availability, coding, and 
locations of items were enabled through use of the Indexed Data Item File. 

1) KWIC index. Figure 4-B-3 

The KWIC index is an alphabetical listing of all non-trivial words 
(key words) contained within the key word titles of the File Sheets. 

"KWIC" stands for Key Word In Context, and each line of the index has 
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the key eord centered within e portion of ite or igin al context* Om line 
eppeers for each occurrence of a key word, and the RECID (record identi- 
fication nnod>er) of the titled item appears to the rigjht of the 1^®* 

The fall entry can be found by locating this RECID .number in the RECID 
index. Figure 4-B-5. 

A nnc index lends itself ideally to the clustering of available 
data. The resear cter may have a theoretical concept in mind and 1^^® 
as to what might be 'Measures of that concept* He can instantly find 
«v.' liable and relevant items of data by referencing words in the KHIC 
index that are descriptive of the measure* He may be motivated to new ^ 
ideas for measuring by a directed search throng the ind e x . For example, 
if information about high school seniors is desired, one could find all 
the references with the word •'senior** in the title by simply finding^the 
word* After reading the title segpmnts, one notices that "graduates 
is also a good place to look for this information. 

2) Author index. Table 4-B-4 

This index gives a list of all the items arranged in order of file 
and of physical location within the file* The library card catalogue 
author index is analogous to this index: items ordered by location within 

a file are analogous to books ordered by title within an author* 

The first line of each three-line entry is the file n a me followed 
by a slash followed by the item's physical location* The RECID again 
appears on the ri^t* The second line is the item title, and the third, 
the schodl unit* 

A reconstruction of the I ow a Format Book including references to 
complete Item documentation was obtained through production of the Author 
index* 

3) RECID index. Table 4-B-5 

iivigjit in ordered according to its record identification number, 
and each entry contains all the Information punched for the item* Of 
course, the actual File Sheet may be located by use of the RECID nuod>er, 
and full docuomntatlon including coding can be found there. 



Smlmctip K. the Data 

produced made answerable the cpiestions about availability 

of data, coding, and location. A study of the indexes revealed that five 
files were exhaustive, i*e*, five of the thirty-eight files contained 
practically all the information in the bank* These files were requested 
and were received from the Iowa Information Center. Four of the five 
files were contained on reels of magnetic tape; the fifth, on cards* 



Finalising the Item File 

Since only a sub-set of the entire bank of items was acquired, the 
Data Item File had to be modified accordingly and then reindexed. Prom 
the original set of punched records of item information, and File Sheets, 
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a selection pertaining only to the files received was made. Certain 
additions had to be made since the data received were of a differei^ 
school year t^**n were the content analyzed file docuiaents. Other nodifi* 
cations to provide for a precise physical location of items had to be 
included for later programming purposes. 

The finalized selection of punched card records was then run 
through the indexing program ^ and final indexes were produced. 



C. Processing the Census Data 



Resolutions to the problem of data and the problem of data units 
were brou^t about through the processing of census data. The first 
problem, that of data, was resolved by applying similar proceduMS to 
census data as were applied in processing the educational data (section 
B of this chapter). A comparatively small effort was required in this 
area of processing, however, since census data Ate highly organized and 
well documented. The second problem, that of data units, was concerned 
with bringing data available for census units into consonance with data 
available for school units. As was previously mentioned (Chapter 1. 
Section C). school district boundaries do not follow any census unit 
boundaries. Thus, transformations had to be performed upon census unit 
data to enable cosipatibility with school unit data. The 'Snapping of 
the Iowa school districts provided the basis for this transformation. 



Census Dhits 



A complete list of census definitions is given in Appendix A. but 
those listed below are sufficient for an understanding of this section. 

Counties 

Tho primary divisions of the States except for Louisiana 
where th e divisions are called parishes and in Alaska uhere 
data are collected for election districts. 

Hinor Civil Divisions 

The primary political divisions into which counties are 
divided. Uhere more than one type of primary division 
ex i s ts in a county, the bureau uses the more stable division 
so that comparable data are available from census to census 
(nchool. taxation, election units, etc. are not considered 
stable). 

Standard Metropolitan Statistical Areas (SH5A) 

A unit whose entire population is in and around the city 
whose activities form an integrated social and economic 
system. Except in New England, an SiSA is a county or 
group of contiguous counties which contains at least one 






i&>nf"t 



city of 5C,JC0 inhabitant! or nore or “twin citteo'* with 
a coiabined population of at laaot 50*000# In addition to 
the counties containing such a city or cities* contiguous 
counties are included in an SHSA* if* according to certain 
criteria* they are essentially taetropolitan in character and 
are socially and economically integrated witl) the central 
city. Criteria are listed on page viii of PC(1). 

Census Tract 

tracts are ssiall areas into idiich large cities and 
adjacent areas have been divided for statistical purposes* 
Tract boundaries were established cooperatively by a local 
cOEdittee i»nd the Bureau of the Census* and were generally 
designed to be relatively uniform with respect to population 
characteristics* economic status* and living conditions. 

The average tract has about 4*000 residents. Tract boundari.es 
are established with the intention of being maintained over a 
long tisae so that comparisons may be made froD census to 
census. 



Place 



Concentrations of population* regardless ot the existence 
of legally prescribed limits* powers* or functions. Host 
of the places listed are incorporated as cities* towns* 
vill 'ea* or boroughs* however. The larger unincorporated 
place outside the urbanized area was delineated and those 
with a population of 1*000 or nore are presented in the 
manner as incorporated places of equal size. 



Enumeration District 

A small area assigned to an enumerator which must be oaonrassed 
and rscorted separately. In moot oasec an BD cpoto^ approxi- 
mately' 250 housing units. The boundaries far the BD s ror the 
imst part follow such features as roads* streets « railroads* 
ctreams* and other clearly defined lines idilch agy be easily 
identified by censoe enooeratorc in the field sod often do not 
eooforc to the boundaries of political unite. 



Availability and Selection o f C3nsus Data 

Census information is divided in three ways: major subject 
geograidiic areas* and retail trade areas (see Appendix A, p. 5). Of the 
eight major subject fields* population and housing data were considered 
to contain the best selection of items which could characterize a 
conerunity. 

Booklets describing data availability were readily obtained along 
with other census publications. These are listed in Table 4-A-2. Data 
obtainable in the form of magnetic computer tape reels are available at 
small census unit levels— enumerst Ion districts* census tracts, and minor 

civil divisions. 



The data are of one of two types according to the method of 
collection— count data ano sample data. The count data arabased on a 
lOOX sanple while the sample data are based on a saoplie of approxi* .lately 
25Z: calculations are conducted by the Census Bureau to bring sample 
figures up to an esti late of total population figures. No sample data 
were tabulated for enumeration districts because of sampling variabil y 
at the level of this smallest of census units. A list of the subject 
categories of data items available for count data and for sample data 

is given below: 



Subjects Available for 
Count Data 

(evailable for enumeration 
district, census tract, minor 
civi'l division) 



Subjects Available for 
Sa^iple Data 

(available for census tract, 
minor civil division) 



1. Age 

2. Household Relationship 

3. Marital Status 

4. Race 



1. Age 

2. Class of Worker 

3. Country of Origin 

4. Emplo 3 saent Status 

5. Income of Families and 
Unrelated Individuals 

6. Industry 

7. Marital Status 

8. ISarried Couples, Families, 
and Unrelated Individuals 

9. Means of Transportation 

10. Nativity and Parentage 

11. Occupation 

12. Place of Work 

13. Residence in 1955 

14 . Res idence : Urban-Rural , 

Farm-N on- farm 

15. School Enrollment 

16. Years of Sc'aool Completed 



So, althou^ the smaller unit was desirable, both sets of data 
could not be obtained for enumeration districts. Ar. 
in the mapping of Icwa school districts, a decision x/ita large ramifi- 
cations was at hand in selecting the data. Even though non-dollar costs 
were of main iateref,t in the research, actual dollar costs had to be 
considered in the course of research activities. Tlie considerati«s of 
time and cos : aade clear the decision as to \;hich units of availability 
the data should be chosen. The UCD (minor civil cKvision) and census 
tract units ware the only practical choice. This cooica also meant a 
wider variety of cocomunity data— the entire scope presented above. 



Indexing the Census Data 

Along with the magnetic tape reels of actual data, two technical 
memoranda explaining the record layout and content of the sample 
count data were received. From these two documents, a census item file 
was constructed by completing a census data item file sheet for each 
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item contained on the tapes; the coupleted file sheet contained the 
title of the item and its exact physical location on the computer tape. 
This information was punched on sets of computer cards and indexes of 
the census data information were produced automatically. 

The procedures presented so far in this section have closely 
paralleled those of the previous section, processing the educational 
data. And at this point, with the indsx refined, a rieans of instantaneous 
access to information concerning both educational and coomiunity data 
existed. 



Disconsonant Units 

As ms stated earlier, the school unit is not geographically coter- 
uinous with census units. The "disconsonance" is best disused by 
classifying the school district according to nultiple or single high 
school district, and location of the district inside or outside an 
Table A-C-l lists the four cases. The classification is a useful aW to 
discussion, but is imperfect since school districts may have ^ in ^ “ 
norrcounties and, thSefore, be neither entirely inside or 'xnitslde an SUSA. 

Case I. Single High School, Non-SliSA 



Geographically, high school attendance areas for disteicts with one 
high school nnit are equivalent to the district itself. Th<«efore, the 
major educational data unit in this case is the district. The major 
census unit is the township, sometimes referred to as the MOD (minor 
civil division). Figure 4-C-l represents a section of central loiro 
and exemplifies Case I. Note that the school district boundaries do 
not follow the township lines. The majority of the disconsonance was 

of this type. 

Nothing was contained in either the census data or the educational 
data to indicate the correspondences between iiCD's and school districts. 
Ilaps displaying both units had to be c-jdied in order to 4x*cover pro- 
port icxial correspondences. 



Case II. liultiple High School District in SliSA 

The state of Iowa possesses five S14S^* cities, i*e*» five cities 
within tracted araas. The school district in each of ® 

multiple hi^ school district. In this case the major education^ data 
unit was the high school attendance area, and the census ^ 

the census tract. Figure 4-C-2 is a census tract map of Cedar ®f^®» ^ 
and exemplifies Case II. In most instances, census tracts wre 
contained within one hi^ school attendance area, but in outlyi^ districts 
where tracts became larger, school attendance areas cut across tracts. 

Again, maps with attendance areas drawn on ^quired 

and studied before proportional correspondences could be discovered. 
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Figure 4-C-3 

Disconsonance of High School Attendance Area and Census Tract * 







Case III* Single High School District in SliSA 

The districts outside the SilSA city but within the SMSA county were 
all single high school districts* The school unit v^s therefore the 
district, and the census unit, the census tract* The outlying tracts 
were as large or larger then HCD’s, and Figure 4-C-3 gives an idea of 
the relative sizes of the disconsonant areas* 



Case IV* Multiple High School District, Non**SMSA 

There was only one occurrence of a multiple high school district 
in a non-tracted area* The disconsonance was between high school atten- 
dance area and HCD* Figure 4-C-l is again useful to understanding the 
type of disconsonance if the shaded area is considered to be a single 
district with two high schools* 



A Happing Solution to the Problem of Disconsonant Units 

How can census data be accurately allocated to schools or school 
districts if the proportional correspondences are not recorded except 
in pictorial form? After much deliberation, it was decided that the only 
practicable approach to the problem was to establish which census tracts, 
towns, cities, townships or parts of them are included within the var- 
ious school district boundaries* The cities, towns, and smaller census 
tracts posed no particular problem since they were located completely 
within the area of the school unit, but the townships and outlying census 
tracts because of their size had to be parceled to two, three or four 
different school districts* The most obvious basis for splitting up the 
townships was that of proportional area contained in the various school 
districts* But was this basis justifiable? 

Professor Randall Sale of the University of Wisconsin Geography 
Department was asked to do sorae research on this question* Professor 
Sale selected three townships in Iowa with widely differing terrain and 
had dwelling units plotted on the maps of the townships to test an 
assumption that population is a continuous and uniformly distributed 
variable* After plotting the areas and studying the relationship between 
the terrain and the population distribution, evidence indicated that 
defining population on the basis of area was a fair assumption* In 
other words, one can draw lines through Iowa townships in almost any way, 
and the population will be divided In much the same way as the area* 

Keep in mind, hov^ever, that this method is used with townships in rural 
areas or with census tracts in highly urbanized areas, if partitioning 
is necessary there* 



The Idappine Task 



The U*S* Census Geographic Code Scheme Booklet (Table 4-A-3,M) 
contained the names and identification code numbers of all the geographic 
census areas in Iowa, e*g*, counties, townships, towns, parts of towns In 
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a tovmship, etc. With the aid of a computer, forms depicted in Figure 
4«C*4 were produced* The next step was to travel to Des Uoines with the 
forms, study a farrago of school district maps prepared by county super- 
intendents, and record the proportional assignments of area (the italic 
of Figure 4-C-4). Census tracts for Iowa*s five SMSA*s were assigned 
similarly to high school attendance areas. 

More than four hundred townships were apportioned in approximately 
one hundred man-hours of work. Local people were trained on the job in 
Des lloines and iTere supervised by a research staff member. The accuracy 
of certain of the areal apportionments was questionable, however, because 
of lack of definition in some of the maps submitted by the county 
superintendents . 



Transformation of Census Data 

Basically, two related lists had been compiled; the first was a list 
of all the geographic areas in Iowa— areas 'tlhich had been assigned to 
school districts; the second was a list of actual census data units— 
units for which data were stored on magnetic computer tape. 

Figure 4-C-4 is the list of geographical areas and proportional 
assignments for Wright County, Iowa. These areas are displayed in 
Figure 4-C-5with school districts superimposed. 

The areas preceded by an asterisk in Figure 4-C-4 are also actual 
census data units— units for which data were stored on tape. The units 
were of three kinds: 

If the township contained cities or parts of cities of population 
2,500 or greater, the units for the township were 

1) Cities or parts of cities of population 2,500 or greater. 

2) The remainder of the township, i,e., the township minus 1) 

If the township had no large cities, the unit for that township 

was 

3) The entire township. 

With these two lists, enou^ information was present to automatically 
determine the census unit composition of each school unit. For example, 
from looking at Figure 4-C-4 one sees that school district 594 is composed 

of; (see also Figure 4-C-l) 

97% of the remainder of Belmond Township 
:*100% of Belmond City 
•'r 22% of Blaine Township 
+ 30% of the remainder of Grant Township 



A computer was programmed to perform a more complicated set of opera- 
tions than those indicated above, and the problem of disconsonant units was 
finally resolved. Using the results of mapping as a basis for the transfor- 
mation, census data were allocated to the proper school districts. A final 
computer tape resulted idiich contained the proper community data to describe 
each district. 
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Identification 

Code Numbers District % of Area District % of Area District % of Area 

of Area Area Name Numbers in District Numbers in District Numbers in District 
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Figure 4-C-A 

Proportional Assianments of Census Areas To 
School Districts 
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CHAPTER 5 



SPECIFIC ANALYTICAL AND THEORETICAL TECHNIQUE 



la thi. chapter the pettlcalar theoretic 
operatloos are detailed* In Section A, a theory this 

St«en high achool size and student 

theory, clusters (characteristics, processes, qualities, etcw 
defined rdilch together manifest a conceptuallMtlon o 

problem. In Section B. specific Sl^s^ldL^ measure- 

clusters m;.^t be measured and on the degree to *lch such we« 

ment can be realized vith the presently 
TZ ^seated a partitioning and grouping 

indicators, into the defined clusters. ^ !“tmoL tota 

S::‘s^c«i:^''^^-U*s «d ^“r^etatlons of the result, of the 
Operations are presented In Chapter o. 



A. A Theory for Conceptualizing Student/School 

Interaction 



The theory presented in *^1. 
eral goals: it was necessary to establish a t^ore^ca 

understanding of the processes and factors at work in , 

between high school size and non-dollar costs, first, ^ order to 

evaluate the adequacy of the dal^ ba^ of^n^^^erles of defined 

perfcTm actual data analysis. to -aspects of the 

processes chan acterj.j tics whxch «,-r,refl*e'* .are defined in 

subst. ntive oroblem. The characteristics a. p ^ ^ include *'high*' 

the first goal, e-/aluatlng the adeonacy of the data ban.t cw 
charccterisclcs ^d the seconi goal, setting 

^*data ^ly^s, the Sts sLo^*^°' 

^nSulr^ tlllsre^'us^opt;- for referring to the defined 
characteristics and processes of the theory. 

the theory is intended to encompass the ent^e 
problem, it draws “P»" ^ 

^“.Lsrsx t:'.;::::, u - ,1 - 




guided Ite’development. First, there Has cOTcem at j 

rte theory for describing clusters iihich f defLed 

student growth and for avoiding the 

for the purposes of adLiinistrative siEplicity and utility, ror . 

the standard financUl cUssifications within which sc^ls 
not considered iiiportant aer 50 in describing the qualities o 
because they affect the students only as secondary causes. Tte 

iSl^Ltes that sufficient descriptive “2heo^. 

the esTCCts of the problem. In certain less crucial parts of the theo^, 

in fact, the major concern was for sufficient rSated 

that more meaningful (prlcary) clusters f 

to the defined austere, (this is explained in more detail later.) 

Four general areas or superclusters are defined athta ?*** 

Commmlty Input. School dediation. Student Outpo^ ^ Sc^l 

The presentation of the theory is made in five ^rts. - 

dusters are initially defined and their general causal ^ 

are explained. Then the four superclustcrs are presented -rtni-aioa 

After^he last page of this section there is a foldout P**® 5®“. ^ 

a iSgrSia^am; Figure 5^-1. of the clusters and 

ships of the theory. It is suggested that the diagram e o 

While the section is being read. 



Overview of the Theory 

The unit ultimately under study is that of m ^tvidual hi^^s c^l 

student, and the process ultimately under »^F S^tlv 

nrowth. A student grows in an environment i*ich includes his family, 

his peers, his church, his town, and his sdiool. For the "* 

establishing a theory of a student’s growth, four complexes of factors 

are identified: 



(a) School Mediation, the processes which occur within the 
high school; 

(b) School Manipulation, the secondary factors which affect 
School Mediation; 

(c) Community Input, the non-school influences in the environment 

(d) Student Output, the effect on the student. 



Their general interrelationships are diagrammed in 
four superclusters correspond to the four main 

The arr^ connecting the boxes indicate the interteUtionships of the 
superclusters . 

Godounity Input includes family and home influences and the e «ect 
of pre-hlgh sAool growth. Th«efote, 

represent toput characteristics of the student body in the high school. 



o 



- 82 - 



TABLE 5-A-l 



THE FOUR AAAIN SUPERCLUSTERS AND THE THEORETIC INTERRELATIONSHIPS 
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For exaaple, the quality of the elementary schools in a comnunity 
influences the initial scholastic abilities of high school students. 

But also, the comaunity exerts a continuing and immediate influence on 
the students while they r»re in high school. For example, the existence 
of a library in a community may influence the academic growth of students. 
There are certainly factors in both the initial and the continuing influ* 
ence of community which could be considered primary in their effect on 
students. Family heritage, elementary education, and community social 
process could conceivably be combined and categorized according to factors 
which directly affect the nature and growth of a student. However, for 
reasons explained later, the Community Input factors are defined rather 
independently of student relatedness ^ communities are characterized 
according to demographic variation. 

School liediation is a particular complex of Influences within a more 
general set of community influences, but the present development of the 
theory of School 1-lediation has been isolated and is more exacting. 

School liediation consists of the forces exerted on, or at least directed 
at, the student and of the concomitant reactions of the student. A 
complete understanding of the relationships between high school size and 
student growth depends on an understanding of the exact nature and^ 
strength of the forces and reactions of School ilediation as they differ* 
entially relate to school size. The clusters which are developed in 
the theory for School Ilediation are intended directly to represent 
processes which primarily cause student growth. 

The Student Output of the community and high school are the qualities 
of the students that emerge from them. The concern is for the character- 
istics of the individual student— his achievements, aspirations, and 
his emotional stobill^*for further growth. Because this study relates 
primarily to the mediation provided by the school, the outputs associated 
formally or informally with the school are emphasized. For example, 
ability to perform a job consonant with the opportunities of the community 
might be an output. Whether or not a student obtains a job might involve 
factors outside the scope of the theory. 

The general causal chain of the theory has now been outlined: 
Community Input determines* the qualities of the students upon entrance 
to school and also exerts a continuing iki>uence; School Hediation is a 
particular influence and is to be examined in detail; Student Output is 
the dependent result. The fourth is a complex of potential factors 
which are grouped under School Manipulation. Xn determining how to 
design, build, or run a high school there are certain variables which 
may be manipulated by school administrators or boards of education. 

Color of the walls, student /teacher ratio, and number of rooms are 
examples. School size, in terms of the enrollment, is the School Manipu- 
lation variable upon which the present research is focused. It is to be 
emphasized, however, that School Manipulation-*and, in particular, school 
size— are not considered part of the primary causal chain. Essentially, 
school size does not affect Student Output; rather it affects School 
Mediation which in turn affects Student Output. It has been necessary 
to establish the primary causal chain of Input, Hediation, Output in order 
to determine the ways in idiich size indirectly affects Student Output; in 
order to investigate the "non-dollar” costs of size. 
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In the rest of this section, the clusters within each of the four 
superclusters will be defined. One further general note should be made, 
however, on the problem of choice of unit. The theory ultimately is to 
be applied to schools; that is, the clusters to be defined are to be 
measurable for schools, each school having a single value. Yet as 
mentioned, the ultitaate unit of the theory is the individual student, and 
the concern is for forces ex.rted on him and for his reactions and q^li* 
ties. The unit of application, schools, se emin gly has as its S/alue for 
a characteristic a distribution of numbers across students rather than 
a single number for a school. When this is of concern, as it is for 
School Hediation and Student Output, two clusters are defined for a 
given initial cluster. One describes the mean-level value for the 
individuals in a school; one describes the variation of the value across 
students. It is assumed that a sufficient description is thus made of 
the distribution of values. 



Community Characteristics as Input 



As mentioned, the Community Input clusters represent two parts of 
the causal chain. First, they represent the characteristics of the 
student population upon input to high school, for the earlier growth of 
input students is carried out within and therefore determined in part by 
the community. The whollstlc community clusters indicate the distribution 
of individualistic student characteristics. For example, a high percentage 
of farmers in a community indicates a high number of students aspiring 
to be farmers. The community clusters are assumed to represent indirectly 
the abilities, aspirations, and emotional outlook of the students as 
they enter hi^ school.* Second, the Community Input clusters represent 
the continuing influences of the community, including the family, while 
the students are in high school. (School l-iediation Includes continuing 
influences of the coomunity which have been segregated for detailed 
study.) TSie community clusters are assumed to indicate such non-school 
factors as family pressures and community activities which help mold the 
output characteristics of a student. For example, students may be 
distracted from their high school studies by family economic problems, 
their choice of study may be influenced by particular occupational 
opportunities of local industries, or community social pressure might 
stimulate academic achievement. These clusters are given below: 



A. Population Dynamics 



B. Population Characteristics 



Al. Growth 
A2. Youth 
A3. Density 

B1 . Rural/urban 

B2. Socioeconomic Level 

B3. Socioeconomic Variation 



C. Socialization Characteristics Cl. 

C2. 

C3. 



Social Activity/Involvement 
Social iloblllty 
Poverty Culture 



*Compare the clusters defined for Btudent Output. 
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D. Economic Structure 



Dl. ffealth 
D2. Industry 
D3. Trade 
D4. Farming 



S. Education Resources 



El. Archive Usage 

E2. Educational Involvement 

E3. Educational Attainment 



Coixiunity Input clusters could conceivably be defined according 
to factors xdiich directly correspond to characteristics of students and 
continuing influences of the coinciunity.* Such clusters could be aligned 
according to causes of characteristics and influences; parental encourage- 
ment toward higher adacation, parental intelligence, and a student's 
activity in community social projects might then be clusters. Or the 
clusters could be aligned according to the effects of the characteristics 
and influences; then cognitive ability, emotional stability, and aggressive- 
ness might be clusters. In the present theory, however, the Community 
Input clusters are defined according to "social area analysis" community 
factors; the clusters are not designed directly to represent particular 
characteristics of students or particular influences on them, but rather 
to characterize the general demographic and social qualities of communities. 
This approach was taken because a partitioning according to primary 
causes would have required too great a degree of theoretical understanding 
of the growth processes", and a partitioning according to primary effects 
would not have been suitable for the kinds of data to* which the theory 
was to be applied.** The approach taken is, however, considered adequate 
for two masons. First, the Community Input clusters were to be used as 
contxDlling variables, not as strictly independent or dependent variables; 
that is, their joint effect in controlling the relationship between 
School llediation and Ctudent Output was of interest rather than the exact 
structure of their effect, and for the purpose of theoretical and 
empirical analysis they were to be treated only as a set and not separately. 
Second, it was assumed that by making a sufficiently complex description 
of general community qualities, most of the significant dimensions which 
differentiate communities would be included. It can hopefully be said 
that the more essentially causal dimensions are thereby transformationally 
related to the ones encompassed by the present clusters. If they are 
not, then they do not differentiate between cocmninities, and they are not 
important in the analysis. In empirical terms, it is assumed that the 
present clusters yield measures i^ich account for the same variation In 
School Mediation and Student Output as would a more causal set, and 
therefore the controlling effect is the same. 

The "social area analysis" approach adopted for defining clusters 
led to the adoption of the five main groups of clusters which are explained 
in the following paragraphs. The studies cited in Chapter 2 of the 



*For example, what causes a particular mental trait? 

**For example, mental test data were not available for entering students, 
only for graduating students. 
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report provided initial inspiration for devising the clusters. However, 
an attempt was made to associate purely analytic constructs with their 
potential effect on student growth""i.e.,on input and as continuing 
influence. Each cluster is defined in terms of what a high value on the 
underlying scale xiould mean, and notes are made concerning differentia- 
tion that could be made between aspects of the defined construct. 

Examples of possible indicators for the constructs are presented in 
Section B of this chapter. The presentation of the clusters is summarized 
in the upper left box of the foldout. 

The first group of clusters defined represent the Population Dynamics 
of communities. The population across a state is not uniform: age^ 

distributions, rates and direction of population change, and densities 
of population vary. Such dynamics of population determine in part the 
quality of life in an area: the distribution of age affects the 

influence of adults on youths; sudden increase in population may cause 
overcrowding in elementary schools; low density of population may reduce 
peer group sizes. Three aspects of Population Dynamics are identified 
and defined as clusters in the theory. 

Al. Growth. A high value correspondpi to a large recent 
increase in population; differentiation may be made 
according to cause of growth such as high birth rate 
and immi^ation. 

A2. Youth . A high value corresponds to a concentration of 
population in the low (e.g., under 25) age group; 
differentiation may be made according to the modal 
^alities of the age distribution, such as a high 
teenage population or a large retirement community. 

A3. Density . A high value corresponds to a densely populated 
area; differentiation may be made according to evenness 
of area population distribution. 

The socioeconomic milieu of an area is described by the Population 
Characteristics clusters. Three major dimensions are recognized here: 
the rural/urban character of an area and two aspects of the socioeconomic 
level of individuals. The occupational aspirations of students, for 
example, are especially correlated with the rural/urban distinction, 
farming areas produce farmers. And, for example, cognitive ability corre- 
lates with (but is not caused by) socioeconomic level. Rural/urban 
character is represented by the first cluster. Because socioeconomic level 
for an area is a distribution of individual levels, three clusters for 
it are defined. 

Bl. Rural/Urban . A high value corresponds to a high amount 
of urban-type life; differentiation may be made accord- 
ing to the process of urbanization. 

32. Socioeconomic Level . A high value corresponds to a high 

average socioeconomic status of individuals across the area; 
differentiation may be made according to the types of socio- 
economic class. 
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33 . 



Variation. A high V«Xue corresponds to a 
h < Bh aivMmt of varlatiS in socioeconomic status ot 
individuals across the area: differentiation may be made 
according to modal properties of the socioeconomic 
distribution— that is, to social stratification. 

The next group of clusters concerns the social PIP®**®®* 
munity and is called Socialisation Characteristics. ®® f ^y 
students and their general responsiveness to education “® be a ^ 

the social processes of their community. For example. *«e m^be a 
sense of resignation associated with the social “*® ^ 
stricken areas. There may be a sense of hope in areas where 

much community activity# 

Cl. Social AcHvlt v/lnvolvement,. A high value corresponds 

to a high aSount of sAcial activity and 1 

comnunity-wide basU: differentiation may be made accord 
ing to the type of activity, such as community improvement 

or community recreation. 

C2. Social liobilitv . A high value 

social mobility; differentiation may be made according 
to its applicability to different social classes and accor - 
ing to the change in mobility across time. 

Povertv Culture. A high value corresponds to a larp nu^er 
of povirty-cuiture conditions; differentiation may be made 
according to the kinds of poverty-culture conditions such 
as family incohesiveness and alienation from community. 

By Economic Structure is meant the arrangement of the total econmic 
situafJ r: cosraunity. This helps dete^ine the 

and, hence, educational aspirations of students and is an i^rtant prac 
“ck complex of dimensions along which communities vary. Four clusters 

are identified: 



Dl, 



Wealth. A high value corresponds to general economic 



affiili^ce in a community ; differentiation may be made 
according to peculiarities in the distribution of 
to circulation or acciimulation of wealth, and to wealth 
in public or private sectors of the economy. 

n2. Industry . A high value corresponds to the existence and 
vitality of industrial operations in a community; differ 
entiation may be made according to whether industry is 
or heavy and according to the number of different industries. 

D3. Trade. A high value indicates a large amount of tr®de^^^^ 
activity; differentiation may be made according to father 
trade is wholesale or retail and according to the kind of 
sales force present# 
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D4. Faming . A high value indicates a large involvement in 
farming; differentiation may be made according to type 
of farming products (dairy, cattle, etc.) and to how 
modern is the operation. 

The final group of clusters concerns the Educational Resources of 
a community. The dimensions Indicate the continuing influence of the 
general educational atmosphere of a cocxaunity. The Interest of students 
in education and their prior goals and achievements in education are 
determined in part by the educational standards, achievements, and inter- 
est of the corxiunity members. 

El. Archive Usage . A high value corresponds to high usage 
of various formal and Informal archives of knowledge; 
differentiation may be made according to the types of 
archives available— libraries, museums, etc. 

E2. Educational Involvement . A high value corresponds to 
involvement and Interest in education of various forms 
of a large part of the population; differentiations may 
be made according to the kind of education— academic, 
vocational, avocational— and according to the direction 
of involvement— participation, administration, encourage- 
ment. 

E3. Educational Attainment . A high value corresponds to high 
average level of education or skill; differentiations 
may be made according to kind of education and to strati- 
fication of attainment. 



School Processes as Mediation 

A student enters high school with certain abilities and aspirations. 

He has a personality produced in home, community, and elementary school 
which can be described in terms of his abilities, aspirations, and 
emotional outlook.* When a student leaves high school he may be again 
viewed in terras of his personality. The change in the personality, the 
growth of the student, is partly an effect of the community (as defined 
before). This mediation by the school is to be analyzed in the theory 
in terns of the primary causes of the effect of the school in student 
growth. The processes which go on in the school and which directly cause 
effects in the student are to be defined. For this purpose the total 
school environraent must be considered, including social and cultural 
processes as well as the more formal teaching processes. The processes 
to be <fefined are essentially individualistic; they involve the interaction 
of a single student with his environment. Since the measurement is to 
be made in terms of schools, separate mean-level and variation clusters 



*See the characterization of Student Output for a more detailed 
categorization. 
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are defined for each process considered. This approach was explained at 
the beginning of the section. 

f The School Hediation complex is divided into two groups of factors 
which are intended to be all-inclusive in terms of what happens to students 
in school. A student has certain stimuli directed at him and he obtains 
certain satisfactions from the school environment; i.e., the mediation 
processes are generally considered to be those of stimulation or those of 
satisfaction. 

Stimulation . In the school environment a student is bombarded with 
various stimuli such as facts, ideas, opinions; the stimuli converge 
continuously and often simultaneously upon him. In the theory the stimuli 
are classified according to a three-way design and amount of stimulation 
is the quantity to be scaled. Some sort of classification is necessary 
in order to make possible the evaluation of the different modes of 
stimulation achieved in different kinds of schools. The first classifi- 
cation is the mean level versus variation distinction, already mentioned, 
which adjusts individualistic to wholistic measurement. The second 
classification concerns the substances of stimuli, and the third concerns 
the structure of stimuli. A stimulus is considered to be a conveyance 
of information (substance) directed toward the student in a certain 
manner (structure). For example, in a language laboratory, the vocabulary 
and grammar of a language are the substance of the stimuli directed at 
the student; and recorded criterion voices, recorded student response, 
and immediate teacher feedback make up the structure of the stimuli. The 
average time in the laboratory across students is the loean-Ievel stimula- 
tion, and the variance of time across students is the variation in 
stimulation. The stimulation clusters then follow a design. In the 



Kean-Level 

Variation x Substance x Structure 

following paragraphs, the levels within substance and within structure are 
explained and thus the clusters are defined. They are diagrammed at the 
top of the middle box in the foldout at the end of this section. 

The substance of stimulation is divided into nine levels. The 
first five correspond to the standard formal academic areas: 

FI. English . 

F2. History . 

F3. Science . 

F4. Ilathematics . 

F5. Foreign Language . 

The correspondence to the formal administration classification is, it 
should be emphasized, imperfect; the concern of the theoretical classifica- 
tion is for the information being conveyed. While within administrative 



o 

ERIC 
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classifications, teaching English composition nay 

history courses, in the present classification the »xBllcltlv 

for tte substance of the stinulation, so such 
disregarded-conposition tau^t in a history f 

stinulation. Three other levels also have corresponding fonial course 

worlc. 



F6n Culture. (Music, art, drama.) 

F7, Vocatiogal Training . (Both cocniercial and trade.) 

Vfl w...«cal Bducation . (Inclndlng interschoUstlc coavetition.) 

The last level 1- cooposed of substance of stimuli not covered in the 
Other eight levels. 

F9> Auxiliary . (Including exceptional education, guidance 
and counselling, remedial reading, etc.) 



The substance of a stimulus is directed at the student ^ ^ 
wav in other words, the aedlation occurs in a certain fashion, a stWM 
hM structure. The essential differentlati« among 
is viewed in terms of social interaction. Three levels are 



Gl. Prinarv Social. 

G2. Secondary Social. 

G3. Asocial. 

When a student discusses current events with hw 

stimulation with a Primary Social structure. Ifcen he 

group lecture from a teacher, he receives sticsulation with a Sectary 

facial structure. When he sees a film, the stimulation 

structure. Stimulation of rimary Social stature 

peers; Secondary Social structure from teachers; Asocial structure, 

equipment • 

Now a particular cluater for stimulation be “® * , 

combination of either mean-level or variance, plus one of the nine l^els 
oH^Lce, plus one of the three levels of structure. 
the average time spent in a language laboratory night be an indicator 

for the cluster: 

iiean-Level / Foreign Language / Asocial. 

A clique of students discussing a novel might be an indicator for 

Variation / English / Primary Social. 

The particular classification of structure of stimulation 
designed M that an apparently Important effect of sch^l f “ 

detected. In a large school (or in any ^ a 

of social interaction is different from that in a smaller school. In a 
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small school everyone knows everyone; in a large school there may be 
strangers In a single classroom. In a small school a teacher isay see a 
particular student for several periods In a day* while In a large school 
this may be unlikely. Consequently* the social atmosphere In idilch 
teaching an d learning take place Is different* and this may be expected 
to account for much of the difference between large and small schools. 
Also large and small schools are known to have different possibilities 
In course offerings and In sectioning classes* so that the substance of 
the stimulation may be expected to vary. 

Satisfaction . A student derives or falls to derive certain pleasures 
In school; l.e.* the school environment Is the setting In which some 
experiences (for example* receiving stimuli) are Intrinsically pleasurable 
or not pleasurable. While stimulation refers to activity in which a 
student engages* satisfaction describes the state-of-belng a student 
enjoys at a particular point in time. Aspects of satisfaction are cate- 
gorised according to the kind of Individual pleasure. 

HI. Intellectual. Acquisition of knowledge Is a goal of 

society; learning Is considered intrinsically desirable. 
Learning a new concept or fact* discovering a particularly 
Interesting piece of literature* or successfully complet- 
ing a scientific investigation ml^t be occasions for 
intellectual satisfaction. 

H2. Sensory . Physical activity may be intrinsically pleasur- 
able. Participation In sports or dance courses might 
yield sensory pleasure. Hote that sensory pleasure Is 
Independent of any Information being obtained about 
physical activity; It Involves plsylnga game, but not 
learning the rules. 

H3. Social . There may be Intrinsic pleasure in Interaction 
with others. Friendships with peers or with teachers 
give pleasure: primary social Interaction may be intrin- 

sically pleasurable. Formal recognition, such as being 
elected a class officer, may yield social satisfaction 
from secondary Interaction. 

These three aspects of satisfaction apply to Individual students. 
Consequently, It Is necessary, as with stimulation, to form clusters 
recording to a desv^'n. The design for satisfaction Is 



llean-Level 

Variation 



X Satisfaction 



Since there are three levels defined for satisfaction, six clusters are 
defined in all. For exataple, the presence of both students who partici- 
pate in no spo?:ts and students who participate In ail sports might be 
an Indicator for 



Variation / Sensory Satisfaction. 



the bottom of the middle 



This is a two-way design and is dlagranned at 
box in the foldout# 



The Stud «^nt as Output 

, . , . «„fli-*f-iefi of students upon entering high 

As explained above, the .. described only indirectly through 

school are theoret-.cally and conrMnitv The qualities, though not 

the general characteristics of the aediated in the high school 

explicitly lSder^»e 

environment. The kinds of qualities «-„.id ideally be parallel, 

the Mme £er input « for output: ^“^“^^ttSLriorend 

but the values of the scales are changed ^ ^ course, a complex 

continuing community influence. «n ^ * £ graduates going 

construct. No single indicator-.such w the 

on to college or percentage ioncem for the^total life style 

complexity of results, for there mu^ groups of output clusters 

of the student after hi^ iMliiies^Aspiiati^T^and Emottonal Outlooks, 
defined in the theory are Abilit » J rhe established 

The three groups of clwters “« ^ ^ c<^idere<I desirable 

goals of secondary schools. This problen of uhat adminis- 

because the theory is to apply to the ^ school size, 

trators affect when they manipulate such facto s 

Again, all defined outputs must be ^,^’^^*i,^Lts. 

^“he Sistributions of the characterirtics “* 

Susters are listed in the louer right box in the foldout. 

Abilities. Each studait has certain ability 

Abi Uy~f bi define' “ S'lSu ^ ^ 

note that achieveisent, in the seMe of re^ed as abilitiei 

extrinsic characteristics as socioeco^xc status are regaroee 

Three clusters of abilities are identified. 

n. r»on<tive. (Including intrinsic knovledge, skills, and 
uental traits.) 

12. P~.iw«»ror. (Including physical capacity and attainment.) 

13. - (Including capacity for interaction and socUl 

Standing.) 

The formal academic goals of a school have representation in the Abllltleg 
clusters. 

Aaniratlons^ Students 

‘fit^nT to X“SS^the“1r:“tSre: clusters related to aspUations. 

Jl. Vocational . A high value corres|<;nds to aspiration to a 
hi^ stales or an appropriate job. 
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J2, Avocation. A high value corresponds to the development 
of an interesting hobby. 

J3. Social . A high value corresponds to the aspiration to 
social interaction, friendship, and leadership. 

Coamunity and especially family influences are very strong in developing 
aspirations. Also, the aspirations of a student may or may not be con- 
sonant with the opportunities of the community or the economics of the 
family. A student may aspire to a job which is not available to him 
in his cttxmnity. Or a student may aspire to college when his family 
cannot afford it. Thus when determining the components of aspirations, 
it is important to consider their relation to the abilities of the 
community, family, and even the student himself. 

tonal Outlooks. The attitudes of a student are crucial in 
determining his success in life and, earlier, in school. Of particular 
interest with respect to school goals and facilities is the general 
receptiveness of a student to gro';tfl:h. A student may be hostile to learn- 
ing, or he may be distracted from study by emotional problems. Three 
clusters of emotional outlooks are identified; 

Kl, Autonomy. A high value corresponds to freedom and 

ability to make choices and is an essential ingredient 
of emotional adjustment. 

K2. Distraction / Inhibition. A high value correspoi^s to 
complications in the responsive systems of the mind as, 
for eiiample, through having several simultaneous stimuli, 
or through having blocking reactions. 

K3. Social. A high value corresponds to the emotional ability 
to undergo social interaction and to general adjustment 
and happiness. 

The Emotional Outloolcs of the students are considered important inputs 
for their further education. 



School llanipulation 

All the clusters describing the qualities, characteristics, and 
processes involved in the causal chain have now been presented. School 
manipulation variables are, as mentioned, outside this causal chain and 
are considered to affect the Student Output clusters only indirectly in 
their effect on School lianipulations. The focus of the present study is 
on school size (enrollment) as the single school manipulation variable. 
To complete the presentation of the theory, other manipulation variables 
to which the theory could be applied are listed in the upper ri^t box 
of the foldout. 
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SCHOOL manipulation 
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directly/ but which do not directly 
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EXAMPLE OF SCHOOL MANIPULATION VARIABLES 

1 . School Size (•nrollmont) 

2. School Size ^lant) 

3. Administrative Complexity 

4. Design Factor (e.g./ kinds of chain) 

5. Stu<font/teachf* Ratio 

6. Textbooks Used 

7. Hot Lunch Program 



Mnmunity detennines 
put qualities of the 



II 



SCHOOL MEDIATION 
Consisting of processos and forces 
diiected at students in high school 
and of their concomitont reactions. 






The School Monipulotion 
voric^les effect the school 
mediation 



SCHOOL MEDIAnON CLUSTERS 



STIMULATION 










STRUCTURE 










Primary 


Secondary 










Social 


Social 


Asocial 








G1. 


G2. 


G3. 








MV 


MV 


MV 


c 


FI. 


English 


MV 


MV 


MV 


IJ 


F2. 


History 


MV 


MV 


MV 


w 

g 


F3. 


Science 


MV 


MV 


MV 


s 


F4. 


Mathematics 


MV 


MV 


MV 


T 


F5. 


Foreign Language 


MV 


MV 


MV 


A 


F6. 


Culture 


MV 


MV 


MV 


N 


F7. 


Vocational Training 


MV 


MV 


MV 


C 

m 


F8. 


Physical Education 


MV 


MV 


MV 


B 


F9. 


Auxiliary 










SATISFACTiON 






HI. Intellectual MV 










H2. Sensory 


MV 










H3. Social 


MV 







The school mediation 
[helps transform student 
input to output. 





r 


STUDENT OUTPUT I 

Consisting of the qualities of the 1 

students' output firm high school in 1 

terms of their change from input status I 
aixi their receptiveness for future growth. | 


h 




L 




J 


STUDENT OUTPUT CLUSTERS 


AWUTY 


• 

• 


• ASRRATION : : 


EMOTIONAL OUTLOOKS [ 


1. Cognitive 

2. psychomotor 
1. Social 


• 

M V : 

MV: 

M V : 
• 


: J1. Vocatiorwl MV* • 

• J2. Avocational M V : : 

: J3. Social M V j j 


K1. Autorxxny MV ; 

K2. Distraction M V : 

K3. Social MV : 



ERIC 






hiUMeas. 



B. Ideal Measurement and the Availability o£ Data 



The relatioR'^hip between high school size and educational productivity 
was conceptualized in the previous section. This conceptualization was 
needed in order to define the data analytic operations. It was also 
needed to evaluate the adequacy of the data bank contents for investigat- 
ing the substantive relationship between school size and non-dollar costs. 
The theory is detailed because the relationship is complex. It was 
essential to be able to describe the various effects o£ size and to 
determine the mechanisms through which the effects are transmitted. In 
particular, the social characteristics of the community and school 
environments and the social consequences of size are emphasized. It is 
postulated that the major difference between large and small schools lies 
in their social qualities. 

The Iowa Educational Information Center and the U.S. Census Bureau 
data banks were established for purposes other than the present research. 
The Iowa data bank grew out of the school and student accounting service 
established on a statewide basis. The U.S. Census Bureau data bank was 
established originally for electoral redistricting and has grown as new 
demands for information were made. From the viewpoint of this research, 
neither source makes adequate measurement of the social characteristics 
and processes in its domain. 

This section contains information on how ideally measurement would 
be made for the clusters of the theory. The information is divided into 
sub-sections, one for each of the super clusters— Community Input, School 
lied iat ion, and Student Output. While the paucity of data has led to a 
partial redefinition of the clusters, an argument for the superiority of 
the original clusters is made. 



Measuring Communitv Input 

An example of an ideal indicator for each of the Community Input 
clusters is given in Table 5-B-l. The Census Bureau data were found to 
be adequate; several ideal measurements were achieved and many reasonable 
alternate Indications were found. However, three shortcomings in Census 
Bureau data must be noted. 

First, with respect to educational data (1S65-1936), the census data 
(1960) are older. No research study can use new census data except in 
the decennial year. A special census irould have been enormously expensive 
and using older school data would have been impossible since educational 
data banks were not in existence in 1960. Some 1960 figures, such as 
population, are fairly accurate substitutes for 1965 figures; i.e., the 
correlation across school areas between 1960 and 1965 populations is 
close to one. In the areas where there are discrepancies, it must be 
hypothesized that social and economic changes have taken place. For 
example, if a community's population has increased greatly from 1960 to 
1965 relative to other communities, it is likely that some immigration 
has occurred and that urbanization is taking place. Such changes are 
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TABLE 5-3-1 



IDE;.L 1NDIGAT(»S for COIIMUNITY INPUT 



Cluster 

Al. GroxTth 
A2 • Youth 
A3. Density 

B1 • Rural /Urban 

32. Socioeconomic Level 

33. Socioeconomic Variation 

Cl. Social Activity /Involvement 
C2. Social llobility 

C3. Poverty Culture 

Dl. Wealth 
D2. Industry 
D3. Trade 
D4. Farming 

El . Archive Usage 

E2. Educational Involvement 

E3. Educational Attainment 



Indicator 

7 * population increase 1960-1965 
% of population less than 20 years old 
Population per square mile 

% of population living on farms 
l-Iean family income 
Variance of family income 

X of adults belonging to a social club 

Difference between parent and child 
occupation 

Unemployment rate 

Per -capita personal wealth 
Per**capita industrial business worth 
Per-capita retail sales 
Per-capita farm worth 

Per -capita book circulation 
Per-student PTA attendance 
X of adults who are college graduates 



- 97 - 



o 

ERIC 

MMilRlffTIfliU 






very nuch related to the social anJ econonlc environment of a community* 
In particular 9 they must be considered related to school size and non- 
dollar costs* A student who has recently moved into a community makes 
friends* The processes involved may be different in large and small 
schools* This reasoning can easily be extended to other variables* It 
implies that measurement error in using old census data is likely to 
correlate with the essential variables of the analysis and, therefore* 
is likely to cause distortion in the results** 

The second major problem with census data concerns the measurement 
of the social characteristics of a community* For the purpose of the 
present research, measurement would ideally be made of social processes 
as they occur in a community* But the Census Bureau does not make 
social inquiries, such as asking persons to what organizations they 
belong* In defining the social climate of a community— the quality of 
the social environment and life of its occupants— the absence of direct 
social inquiries is a limitation* With respect to the present problem 
of examining the relationship between school size and non-dollar costs, 
defining this climate is important: the social life of the community 

at large overlaps with the social life of the school* In the present 
theory, the social life of the school is considered the crucial interven- 
ing variable between size and non-dollar costs* 

The third point which should be made about census data is that the 
census is an inventory of individual characteristics* It is not intended 
to describe the institutions of a connaiinity* mils makes it difficult 
to measure the economic structure of a community* Using census data, 
economic structure must be inferred from the occupations of the people 
of the community, and this probably does not yield an adequate descrip- 
tion of the dynamics of the economic structure* Similarly, public 
Institutions— such as churches, libraries, and schools— must be described 
indirectly through certain characteristics of individuals. Essentially 
no information is available on the avocational interests satisfied by 
institutions such as social and service clubs* This is a limitation 
in terms of the need according to the theory to define the continuing 
effect of the community on the social and educational growth of students 
in high school* 



Measuring School liediation 

There are difficulties in measuring from educational data bank 
information defined for School lied iat ion by the clusters* To overcome 
these difficulties, a simplification of the cluster scheme was adopted 
for performing the data analysis; this simplification is presented in 
the next section* Some further notes are presented first on why the corapkse- 
ity of thS original scheme should be considered necessary for accurate 
description of the process of mediation by the school* 



*Using 1950 census figures for the purpose of measuring and projecting 
such change would have been prohibitively expensive. It would probably 
also have yielded inadequate projections since many .school areas are - 
small geographic areas capable of rapid change. 
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In the original scheme, there is a three-track design of clusters 
concerning the Stimulation that students receive in school. The first 
track in the design is Substance. Its levels correspond to nine major 
subject matters in the curriculum of a high school. Larger schools 
usually offer more courses; e.g., they offer more advanced courses in 
subjects such as science and foreign language. Therefore, it is 
important in relating school size to student output to be able to con- 
sider the differential effect of program offerings— i.e., of Stimulation 
according to the various subject matters. The second track in the 
design is Structure, and its levels correspond to the node of 
stimulation— by primary social interaction, secondary social interaction, 
and by asocial means. The present theory places emphasis on the social 
differences between large and small schools. The social atmosphere in 
a classroom differs qualitatively according to school size. For example, 
the students and the teacher nay not know one another as well in a large 
school; in a small school there may be more school-wide association. In 
a homogeneously grouped classroom in a large school, there will be less 
variation in ability and attitude, causing less conflict than in a 
heterogeneous classroom of a small school. Therefore, levels of Structure 
are considered necessary for description of the essential differences 
between large and small schools in terms of how learning takes place. 

The third track is distinguished by mean level and variation of stimula- 
tion. This distinction is necessary since the amount of stimulation 
for a school is really a distribution of the amounts received by the 
individual students. In summary, the original clusters for describing 
Stimulation were necessary, and the lack of data adequately to measure 
them is a stated limitation. 

Satisfaction has clusters defined according to a two-track design 
in the original scheme. The levels of the first track are intellectual, 
sensory, and social satisfaction. These levels represent an attempt to 
define the emotional qualities which may be degraded in large and densely 
populated social cettings. In particular, it is suspected that large 
schools may cause students to feel alienated and to forget academic goals 
because of emotional problems. The distinction between the three kinds 
of satisfaction seems necessary in order to determine the dimensions 
of emotional attituue in large and small schools. The second track 
corresponds to mean level and variation in amount of Satisfaction. 

Again, the amount to be measured is the distribution of aoiounts for 
individual students. 

In terms of designing further research into the question of school 
size and tion-dollar costs and in terms of further defining the , 
insufficiency of the available data, it is useful to consider what kinds 
of indicators ideally should be available for satisfying the original 
scheme. A list of indicators is presented below: 

FI-Gl-H. Ilean Level of English Primary Social Stimulation 

1. mean number of hours students spend discussing literature 
in small groups 

2. homogeneity of student ability in English classes 



FI*G2-ll« Hean .Level o£ English Secondary Social Stimulation 

1. mean number of hours a student attends lectures about 
literature 

2. mean amount of friendship of students toward English 
teacher 

FI-G3-M, Mean Level of English Asocial Stimulation 

1. mean nuciber of hours a student spends reading assigned 
books 

2. mean amount of blackboard usage in English classes 

Maan Level Intellectual Satisfaction 

1. mean number of unassigned books a student reads 

2. proportion of students receiving honor awards 

H2-11. ilean ’Level Sensory Satisfaction 

1. mean lighting power of the classrooms 

2. proportion of students engaged in extracurricular athletics. 

H3-II. Mean Level Social Satisfaction 

1. mean number of hours students spend in private conversations 
in school 

2. number of school social events. 



Examples are given for mean-level primary, secondary, and asocial structure 
stimulation in English and for mean-level intellectual, sensory, and 
social satisfaction. Such data clearly require special data collection, 
since it would serve no auxlliasry practical function; such as school 
accounting, in a data bank. Yet, it would provide much more exact 
explanations of the differential effects of school size on student output. 

There is another problem in dealing with the Stimulation clusters 
as originally defined. The subject areas for Substance were e 3 q>licitly 
specified as referring to kinds of substance, not to administrative 
units such as classes. English Literature as taught in history classes 
should be tabulated as English stimulation. The amount of crossover In 
subject matter may be expected to vary according to size of school; in 
a large school, teachers i^y be more specialized in what they teach. 

This distinction can be lost in the data available in education data 
banks and the result is distention. 



Measuring Student Output 

The adequacy of the educational data bank for measuring the clusters 
defined for Student Output is mixed. Precise measurement of Cognitive 
Ability is possible; in fact, the Cognitive Ability clusters may be* 
subdivided. However, there are no raeasures of Psychomotor and Social 
Ability and there are not sufficient data to maintain the distinctive 
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clucters concerning aspects of Aspiration and Emotional Outlook* Because 
of the abundance of cognitive ability data and the paucity of other 
data 9 a revised scheme of Student Output clusters was designed. It is 
presented in the next section* Here, the implications of the abundance 
and paucity are discussed* 

Examples of ideal indicators for the clusters as originally designed 
are presented below* Only the mean-level clusters are given. 

Il-^* Mean-Level Cognitive Ability 

1* mean composite score on ability— achievement test 
2* proportion of students planning to attend college 

X2<^« Mean-Level Fsychomotor Ability 

1* mean score of students on a test of physical fitness 
2* proportion of students regularly participating in sports 

I3-H* Mean-Level Social Ability 

1* mean ntimber of friends of a student 
2* proportion of students attending social events 

Jl-H* Mean-Level Vocational Aspiration 

1* mean prestige score of vocational aspiration 

2* mean discrepancy between student *s occupational aspiration 
and parent's occupation 

J2-M. Mean-Level Avocational Aspiration 

1* mean number of hours spent per week on a hobby 
2* mean number of hours spent per week watching television 

J3-M. Mean-Level Social Aspiration 

1* proportion of students aspiring to a social occupation 

2* proportion of students planning to get married soon 

Kl-M* Mean-Level Autonomy 

1* proportion of students who are certain of their plans 
2* proportion of students with paying jobs, 

K2-M* Mean-Level Distraction 

1* correlation between mark-point average and composite test score 
2* average ntimber of days absent 

K3-1I* Mean-Level Social Outlook 

1, proportion of students v/ith girl (boy) friends 

proportion of students who report that they are generally happy* 
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The variety of cognitive ability data is due to the coding in the 
data banlc of all subscales of the Iowa Test of Educational Development 
for all seniors* In redefining the Cognitive Ability clusters, various 
groups of subscales are clustered. This operation was motivated by 
recent work by Bock and Uiley (personal communication) vhich suggests, 
on the basis of a study in the Chicago area, that schools produce 
characteristic patterns of subscale xesponses on ability and/or achieve* 
ment tests* In terms of the present research, it is very relevant to 
ask whether large or small schools produce students with distinctive 
patterns on test subscales**i.e., is there a differential effect of school 
size on verbal aud quantitative scales? 

On the other hand, the paucity of data on the other facets of output 
impose a limit on the detail in which the effect of school size can be 
specified. The emotional makeup of a student, his plans, his interests, 
his health, and his social qualities are admitted targets of secondary 
education. In its most obvious form, this makeup is manifested in the 
teaching and grading of “citizenship". A large part of a school's 
efforts are aimed at these aon*academic concerns. Yet, while for academic 
areas, there is an elaborate attempt to evaluate both the syst^ and the 
individual, there is essentially no recorded information for evaluating 
the non*academic areas. To obtain such data would, of course, be 
technically difficult**and perhaps not entirely desirable in terms of 
long*range standards and goals of society**but its absence certainly 
makes difficult the evaluation of school systems and of school size. 



C* Clustering of the Available Data 



In Section A of this chapter, a theoretical framework was presented 
for studyi'ig the relationships between high school size and educational 
productivity. In Section B, notes were presented on what kinds of data 
ideally would be used to perform analysis within that framework and 
within the limitations of the data available from the Iowa and U.S. 

Census Bureau Data Banks. It was not expected that the available data 
would satisfy a complex a priori theoretical scheme. Yet, it was 
considered essential to develop the scheme and to work as far as possible 
within it. This process was described in Copter 2. 

This section contains descriptions of clusters of indicators 
actually determined and computed from the data samples. As explained 
in the previous section, certain general revisions of the original 
clusters had to be made. In addition to these general revisions**in 
School Ilediation and Student Output**numerous minor revisions were 
made. The "input" of a cluster is defined according to its indicated 
composition. When indirect or approximate indicators were available 
for a cluster, the meaning of the cluster is changed. It should be 
emphasized that these changes do net represenc error in the sense of 
random measurement error, but ratter error in the sense of change in 
the face value or validity of tha clusters. 



The description is given in three parts, one for each of the 
superclu8ters*«Cociaunity Input, School lied iat ion, and Student Output. 

For each supercluster, the revisions necessitated by the availability 
or unavailability of data are presented. The indicators are then 
coiaputed and the revisions in their assignment given. One point should 
be made on the definition of variables. There are three types of school 
organizations represented in Iowa: six? four? and three-year hi^ schools. 
To make the data compatible, usually only data concerning students in 
grades 10-12 inclusive were used. 



Coomunity Input 

As noted in the previous section of this chapter, there were 
sufficient data in the Census Bureau data bank to compute indicators for 
all of the pre-defined ccKamunity input clusters. It was not necessary, 
therefore, to make changes in the theoretical scheme. It should be 
reiterated that there are several shortcomings in the use of census data. 
First, because of the history and nature of the census, the data necessary 
for constructing ideal indicators were not available; i.e., the kinds of 
cocamunity social processes idiich are considered important for the measure- 
ment of the effect of school size on educational productivity could not 
be directly indicated, but had to be inferred at a certain loss of 
accuracy. Second, the census data are old with respect to the school 
data used. The changes which may have occurred in coomunities certainly 
would be crucial to the question of school size. The fact that a comminity 
has changed recently would have ramifications for the social processes 
operating in the comounity and for the consequent social characteristics 
of the students and the success of their performance in large or small 
schools. 

Population Dynam-lgg, The Population Dynamics clusters consist of 
the following indicators: 

A1 . Growth 

Al.l Birth rate: persons less than one year old per lOOC 

population 

A1.2 Age of residence: percent of residential structure 

built after 1950 

A1.3 Inmigration: percent of population living outside the 

county in 1955 

A2. Youth 

A2.1 Infants: percent of population less than 6 years old 

A2.2 Children: percent of population 5-19 years old 

A2.3 Young males: percent of population males 20-29 years Old 

A2.4 Young females: percent of population females 20-29 years 

old 
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A3« Density 

A3.1 Structure density: housing units per residential structure 

A3.2 Unit density: perscms per housing unit 

A3.3 Room density: persons per room in housing units. 

A hi^ aiaount of growth is indicated by 1) a high birth rate, 2) new 
homes » and 3) a hi^ rate of inmigration. These are indirect measure- 
Tients, since comparative figures from earlier years could not be obtained* 
Youth has four indicators corresponding to four segments of that popula- 
tion: 1) infants, 2) children and teenagers, 3) young adult men, and 

4) young adult women. The differentiation between young adult men and 
women was made because of the possibility of different rates of post- 
school migration for young men and women. The three indicators provided 
for density are measures of housing density in three respects: 

1) multiple-unit structures, 2) crowded units in terms of the nuad>er of 
people residing in them, and 3) crowded units in terms of the number of 
rooms* Since the geographic areas of the cocmunities were not obtained, 
no indicator directly measturing land crowding was constructed* 

Ponulation Characteristics . The Population Characteristics clusters 
consist of the following indicators: 

B1 . Rural /Urban 

Bl.l Urban Residence: percent of population with urban 

residence 

B1.2 Rural Non-Farm Residence: percent of population with 

rural non-farm residence 

B1.3 Agriculture Employment: percent of labor force eiaployed 

in agriculture 

B2- Socioeconomic Level 



B2.1 Income Level: mean of family income, measured on a 

13-point scale from 1 = $0-999 to 13 * $25,000+ 

B2.2 Education Level: mean of education for persons 25 

years and older, measured on an 8-point scale from 1 ^ 
no schooling to 8 » 4 or more years of college 

B2.3 Occupation Level: mean of occupation for eo^loyed 

males, measured on an 11-point scale from 1 - laborer 
to 11 = professional 
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B3* Socioeconouic Variation 



33.1 



B3.2 



B3.3 



IncoDe Variatioa: variance of family 

a 13-ooint scale from 1 « $0-SS9 to 13 » $25,000”^ 



Education Variation: variance of education for persons 

25 years and older » measured on an 8-point scale from 

I « no schooling to 8 = 4 or more years of college 

Occupation Variation: variance of occupation for employed 

nuileSf sieflsured on An ll**point scnlo froca 1 a rer o 

II = professional. 



The rural /urban character of a cocacainity is indicated by 1) 
of the population having urban residence, 2) the proportion of e pop - 
latioQ having rural non-farm residence, and 3) the proportiOT ® ® - 

force employed in agriculture. The Census Bureau defines three typM 
residence: urban, rural non-farm, and faro. The rural non- arm ^ 

ponds to rural homes not on farms and to residences to haane s. 
socioeconomic situattoii in a coomunity is represented by the s a 
trilogy of 1) tocooe, 2) education, and 3) occupation. Be^se 
cofasninity situation is construed to be a distribution of the soc oe 
levels of the members of the coinciunity, there is a pair of 
describes the mean-level status and one describes the variation ® 

The income data are for families. The education data 
years and older who usually have completed their formal education, 
occupation data are arranged according to an approximate prestige rating 
of the census, ranging from laborer to professional. 

Socialisation Characteristics. The Socialization Characteristics 
clusters consist of the following indicators: 



Cl. Social Activity/Involveiaent 

Cl.l Social workers: social, welfare, and recreation 

workers per lOCC population 

Cl. 2 Public employees: percent of labor force en^ployed by 

the public 

Cl. 3 Female employees: percent of adult females who are 

employed 

C2. Social liobilitv 

C2.1 Population heterogeneity: percent of populac..on non- 

xdiite, foreign bom, or of foreign vtock 

C2.2 Population mobility: percent of population living in a 

different house in the same county in 1955 



- 105 - 



C3. Poverty Culture 



C3«l Low incoue: porcent of fanillies with Incoioo less then 

$2,j00 

C3.2 Poor housing: percent of residential housing units 

idiich are deteriorating, dilapidated, or which lack 
any facility 

C3.3 Unenployoient : percent of able oales idio are unemployed. 

As stated above, the Census Bureau does not collect data idiich are 
descriptive of the social processes of a community. Therefore, the 
indicators for social activity/ involvement are derivatives. Three 
indicators were computed: the first is an index of the number of per- 

8CKI8 in the community iriiose occupations indicate direct social activity; 
the second, proportion of the labor force employed by the public may be 
related to the extent of community responsibilities; the third, proportion 
of adult females who have jobs, may be related to the social activity of 
women in the community. The assumption is that women who work outside 
the home have more opportunity for social contact. The indicators for 
the social taobility cluster are also derivatives. A high heterogeneity 
of the population may Indicate the presence of social change through 
assimilation or conflict; the housing nobility of the population may 
indicate changing socioeconomic situations. The indicators chosen for 
the poverty*culture cluster are versions of three standard evidences of 
poverty: 1) low income, 2) poor housing, and 3) unemployment. 

Economic Structure . The Economic Structure clusters consist of 
the following indicators: 

Dl. tfealth 

Dl.l Property value: mean residential property value, 

measured on a 10»point scale from 1 ® $0**4,999 to 
10 * $35,000+ 

D1.2 Rent level: mean gross rent, measured on a 13-point 

scale from 1 = $0-19 to 13 * $200+ 

D1.3 Automobile availability: number of automobiles available 

per housing unit 

D2. Industry 

D2.1 Industry employment: percent of labor force employed in 

Industry 

D2.2 ilanagerial employment: percent of labor force employed 

as managers 

D2.3 Clerical employment: percent of labor force eeployed 

as clerks 
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D3, Trade 



D3,l Retail ecployment: percent of labor force employed in 

retail establishments 

D3.2 Service employment: percent of labor force employed in 

personal and business service 

D3.3 Sales employment: percent of labor force employed as 

sales workers 

D4. Farming 

D4.1 Family fare help: unpaid family farm workers per farmer 

D/^.2 Paid farm help: paid farm laborers per farmer 

D4.3 Food industry: percent of labor force employed in the 

food industry. 

The indicators for the wealth clusters were aimed at measuring personal 
wealth in terms of: 1) residential property value* 2) rent level, and 

3) availability of automobiles. The wealth measures were related to 
personal wealth rather than corporate wealth, since census data are 
collected for individuals. The industry cluster Is indicated by 

1) employment in Industry, 2) employment in managerial positions, and 
3) employment in clerical positions. The presence of a business 
bureaucracy is included in this cluster by distinguishing managerial 
and clerical positions. The trade cluster is indicated by 1) retail, 

2) service, and 3) sales employment levels. Business and personal ser- 
vices are combined in this cluster, althou^ they are separate census 
classifications. The indicators for the farming cluster are 1) number 
of family farm workers, 2) number of farm laborers, and 3) percent of 
labor force in the food industries. They are intended to index the 
faming economy. 

Educational Resources . The Educational Resources clusters consist 
of the following indicators: 

El . Archive Usage 



El.l Librarians: librarians per lOOC population 

E2. Educational Involveuent 

E2.1 School enrollment: percent of population enrolled in 

school 

E2.2 School employment: percent of labor force employed in 

educational services 

E3. Educational Attainment 

E3.1 College attainment: percent of persons 25 years and 

older with some college education 

S3. 2 Professional iployment : percent of labor force employed 

in professional occupations. 
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Only one Indicator could I>e found in the census data to index the 
aoount and nature of usage of archives in couinunities» and that was the 
nuuber of librarians per 1000 population. The educational involvenent 
cluster has as indicators I) the percent of the population enrolled in 
school, which reveals the degree to which fanilies in the community are 
directly connected .with education, and 2) the proportion of the labor 
force enployed in educational service. No further indicators could be 
found idiich would show the amount of informal involvement of the community 
in its education processes. The educational attainment cluster has two 
indicators: I) the amount of college education of adults in the community, 

and 2) the amount of professional employment in the community. As noted 
earlier, the presence of such persons may be expected to stimulate inter- 
est in academic pursuit. 



School Mediation 



The Iowa educational data bank does not provide adequate indicators 
for the theoretically defined clusters of the School Mediation super- 
cluster. A revised and simplified theory was therefore developed for 
the purpose of data analysis. The revised system is diagrammed in 
Figure 5-C-l; it was derived more from an ^ hoc consideration of what 
data were available than from a re-examination of the theory. The three 
major areas— stimulation substance, stimulation structure, and satisfac- 
tion— are preserved in the revised scheme, but the amount of detail 
within them and the amount of interaction among them is simplified. The 
substitution of the revised scheme for the original involves a reduction 
in descriptive complexity and of the precision with which the variation 
in student achievement can be assigned to specific processes and quali- 
ties in the schools. 

In the revised scheme, substance and structure have been separated 
in the stimulation clusters. In the original scheme, the two aspects 
were crossed: the substance of the stimulation was differentiated 

according to the structure of the stimulation. In the revised theory, 
there are levels of stimulation and levels of structure, but there is 
not the interaction. Moreover, the number of levels under substance 
has been reduced to five— only the basic academic subjects are 
considered— and the differentiation between mean-level and variation 
effects of substance has been omitted. The single cluster for each of 
the five levels under substance represents the general aiDount, quality, 
and reception of stimulation. The stimulation structure clusters have 
been reduced to two pairs: mean-level and variation in teacher experience 

and class heterogeneity. The teacher experience cluster is considered to 
approxiihatc ' the amount of interaction taking place between teacher 
and student, and the class heterogeneity cluster is considered approximate 
to the amount of interaction taking place among students in the class. 

The satisfaction construct is reduced in the revised scheme to two 
cluster pairs: mean-level and variation in attitude toward study and 

social interaction. By attitude toward study is meant a student’s 
willingness and desire to learn. By social interaction is meant the 
quality of interaction a student has with teachers and with peers. 
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Both these clusters ere considered to relate to the aiaount of satisfaction 
(as defined in the original scheme) a student obtains from school. Be- 
cause they are measures of individual qualities, both mean-level and 
variation clusters are defined. 

Stimulation Substance. The stimulation substance clusters con- 
sist of the following indicators: 

FI. Mathematics 

Fl.l Advanced mathematics: percent of students taking 

an advanced mathematics course 

FI. 2 Mathematics consistency: correlation coefficient 

between ratheciatics marks and MPA's 

FI. 3 liathemstics/like : percent of students who like 

mathematics best 

FI. 4 llathematics/dislike: percent of students who like 

mathematics least 

F2. Science 

F2.1 Advanced science: percent of students taking an 

advanced science course 

F2.2 Science consistency: correlation coefficient between 

science marks and MPA's 

2.3 Science/like: percent of students who like science 

best 

F2.4 Science/dislike: percent of students who like science 

least 

F3. English 

F3.1 Advanced English: percent of students taking an 

advanced English course 

F3.2 English consistency: correlation coefficient between 

English m::rks and MPA's 

F3.3 English/like: percent of students «dio like English 

best 

P3.4 English/dislike: percent of students who like 

English least 

F4. History 

F4.1 Advanced history; percent of students taking an 
advanced history course 

F4.2 History consistency: correlrt-on coefficient between 

history marks and ilPA's 
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F4.3 History/like: percent of students vho like history 

best 

F4.4 History /dislike: percent of students who like history 

least. 

F5. Foreign Language 

F5.1 Foreign language: percent of students fdio are taking 

a foreign language course 

F5.2 Foreign language consistency: correlation coefficient 

between foreign language marks and MPA 's 

F5.3 Foreign language /I ike: percent of students «dio like 

foreign language best 

F5.4 Foreign language /dislikes percent of students who 
like foreign language least. 

All indicators are based on students in grades 10«12 inclusive 
only. For each subject matter area, advanced courses are defined as 
those which are at a higher than required level, such as second year 
algebra, or idiich have nonstandard content, such as astronomy. The 
correlation coefficients are based on all course grades for each subject 
matter area. The like and dislike percents are based on responses to 
two questions concerning which subject the student liked best and 
which he liked least: there are five alternatives other than these five 
subjects. 

Similar indicators are defined for each subject "-mathematics, 
science, English, history (social studies), and foreign language. The 
first indicator for each subject is the percent of students in grades 
10-12 who are taking an advanced course in the subject. This indexes 
the amount of information in the subject vdiich is actually being pre- 
sented by the school. The second indicator is a measure of consistency. 
Considering as one observation each student's enrollment in each course 
in the subject, the consistency measure is computed as the product- 
moment correlation coefficient between the mark received by the students 
and their overall mark-point averages (MPA's), A high coefficient 
indicates that students are not performing erratically in the subject. 

The third indicator is the percent of students who respond that the 
subject is the one they like best; it indexes interest and successful 
stimulation in the area. The fourth indicator is the percent of Students 
who indicate that the subject is the one that they like least. 

Stimulation Structure . Stimulation Structure clusters consist of % 
the following indicators; 

Gl-M. Meen Teacher Experience 

Gl-M.l Teachers/age: mean of the ages of the teachers 

Gl-M. 2 Teacher experience: laean of the number of years of 

local experience of the teachers 

Gl-M, 3 Teacher college credit: mean of the number of college 

credits of the teachers. 
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G1«V. Variations of Teacher Experience 



Gl-V.l Teacher age: varr^ance of the ac^s of the teachers 

G1-V.2 Teacher experience variation: variance of the numbers 

of years of local experience of the teachers 

G1-V.3 Teacher college Credit variation: variance of the 

numbers of * ours of college credit of the teachers. 

G2-M. Mean Class Heterogeneity 

G2-M.] Class size: mean number of students in the classes 

G2-M.2 Class liPA heterogeneity: mean over classes of the 

variance of the students' KPA's 

G2-M.3 Class homework heterogeneity: mean over classes of 

the variance of the students' amounts of homework, 
determined according to a six-level response coded from 
1 ^ none to 6 ^ 21 hours or more 

G2-M.4 Class expectations heterogeneity: mean over classes of 

the variance of the students' educational expectations, 
determined according to a ten-level response coded from 
1 * less than high school completion to 10 ■ graduate 
study. 

G2^V. Variation of Class Heterogeneity 

G2-V.1 Class size variation: variance of the number of 

students in the classes 

G2-V.2 Class MPA heterogeneity variation: variance over 

classes of the variance of the students' HPA's 

G2-V 3 Class homework heterogeneity variation: variance over 

classes of the variance of the students' amounts of 
homework, measured as G2-M.3 above 

G2-V.4 Class expectation heterogeneity variation: variance over 

classes of the variance of the students' educational 
expectations, measured as G2-M.4 above. 

The class heterogeneity indicators are based on all classrooms 
containing at least one student from grades 10-12 inclusive and having 
an academic subject. For each such class, four measures of heterogeneity 
were considered: the number of pupils, the variance of the MPA's of 

the pupils, the variance of the amount of time spent by the pupils on 
homework, and the variance of the educational expectations of the 
students. The means over classes of these measures form indicators for 
cluster G2-H and the variances over classes form Indicators for cluster 
G2-V. 



Three measures are used as teacher experience indicators: 1) ege; 

2) nun^er of years of local teaching experience; caid 3) number of college 
credits. The means and the variances of the measures are defined and 




included in the mean**level and variation teacher experience clusters* 

Four measures are defined for each class in a school (classes without 
any students in grades 10»12 inclusive omitted); 1) number of pupils, 

2) the variance of the HPA's of the students, 3) the variance of the 
amounts of time spent on homework by the students, and 4) the variance 
of the educational expectations of the students* These are all con- 
sidered to relate to jlass heterogeneity; a large class may result in 
a heterogeneous set of behaviors. The other measures are direct 
measures of heterogeneity of student characteristics* The mean-level 
cluster for class heterogeneity has the means over classes of these 
measures as Indicators, and the variation cluster for class heterogeneity 
has the variances over classes as indicators* 

Satisfaction . The satisfaction cluaters consist of the following 
indicators: 

Hl-M* Mean Attitude Toward Study 

Hi -14.1 Student attitude: mean response concerning personal 

like to study, based on a six point scale from 1 ■ dis- 
like vary much to 6 * really like it 

Hl-M* 2 Homework: mean amount of time spent doing homework, 

based on a six point scale from 1 * none to 6 ■ 21 hours 
a week or more 

Hl-M*3 Parent attitude: mean report of parents' feelings about 

the amount of homework assigned, based on a four point 
scale from 1 ** indifferent to 4 ■ too little* 

Hl-V* Variation of Attitude Toward Study 

Hl-V*l Student ittitude variation: variance of response con- 

cerning liking study, based on a six point scale from 
1 ®* dislike very much to 6 * really like it 

Hl-V* 2 Homework variation: variance of amount of time spent 

doing homework, based on a six point scale from 1 ~ none 
to 6 21 hours a week or more 

Hl-V*3 Parent attitude variation: variance of reports of 

parents' feelings about amount of hcanework assigned, 
based on a four point scale from 1 * indifferent to 4 * 
too little 

H? -M. Maan Social Interaction 

H2-M*l Unpleasant experiences; mean frequency of unpleasant 
experiences reported with other students, based on a 
five point scale from 1 » none to 5 * a great many 

H2-M.2 Teacher attitude: mean perception of teacher's view of 

student, based on a five point scale from 1 * very poor 
to 5 « top 

H2-Ii*3 Activities; mean number of activities in which a student 
is engaged, based on the number of “very" or "fairly" res- 
ponses ooncarning activity ’.‘.n ten extracurricular activi- 
ties, Including athletics, student government, social 
affairs, etc. 
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H2«»V. VariJition of Social Interaction 



H2-V.1 Unpleasant eKperiences variation: variance of the 

frequency of unpleasant experience reported with other 
students, based on a five point scale from 1 “ none to 
5 ■ a great nany 

H2-V.2 Teacher attitude variaticm: mean perception of teacher's 

view of student, based on a five point scale from 1 “ 
very poor to 5 * top 

H2*V-3 Activities variation: variance of number of activities 

in which a student is engaged, based on the number of 
'Very” or "fairly" responses concerning activity in ten 
extracurricular activities, including athletics, student 
government, social affairs, etc. 

The attitude«toward«6tudy indicators are based on the responses by the 
students in grades 10-12 to questions concerning: ly how much th^ like 

to study, 2) how much time each week they spend on homework, and 3) how 
their parents feel about the amount of homework that is assigned. The 
first item elicits directly an attitude toward studying. The second is 
less direct; students who study much usually like to study. The third 
item was included here as an added indication of. student attitude toward 
studying. Because the items were measures of individuals, their means 
and variances were computed and assigned to a mean-level and a variation 
cluster. The social interaction indicators are based on responses by the 
students to items concerning: 1) how frequent have 'been unpleasant exper- 

iences with other students, 2) idiat are teacher's atttitudes toward the 
student, and 3) in what extracurricular activities does the student take 
part. The first and third Measures directly refer to social Interaction 
with other students. The second measure is assumed to be related to the 
amount of rapport— or social interaction— with teachers. Again, mean- 
level and variation clusters are defined. 



Student Output 

The Iowa educational data bank information is insufficient to measure 
some of the student output clusters, but there was an unexpectedly rich 
supply of data in the area of cognitive ability and that cluster was 
expanded. This expansion necessitated a revision of the theoretical 
scheme for student output. The revised scheme is diagrammed in Figure 
5-C-2. The original three ability clusters have been replaced by three 
cognitive ability clusters. These three clusters correspond to areas 
covered in the Iowa Test of Educational Development and will not be 
described further. Also, the aspiration and emotional outlook clusters 
had to be simplified. One pair of clusters for each replaced the original 
three pairs for each. The aspiration pair consists of a mean-level and 
variation cluster for general socioeconomic aspiration relating to edu- 
cational and occupational ^als. The emotional outlook pair is a mean 
level and variation cluster for general adjustment. Because all these 
clusters are intended to represent the output of the school system, only 
twelfth grade students were used in computing the indicators. 
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Ability a The ability clusters consist of the following indicators: 
I1«H« Hean Verbal Ability 

Il-II.l Eispression: mean score on ITED-3, correctness and 
appropriateness of expression 

11*41 .2 Vocabulary: mean score on ITED-8, general vocabulary 
I1«M.3 Literature: mean score on ITED-7, literature interpretatioii 

11 - V. Variation of Verbal Ability 

Il-V.l Expression variation: variance of ssores on ITED«3, 

correctness and appropriateness of expression 

I1-V.2 Vocabulary variation: variance of scores pn lTED-8> 

general vocabulary 

11- V.3 Literature variation: variance of scores on ITBD-7» 

literature interpretation 

12 - M, Mean Science and Quantification Ability 

12- ii.l Science Sackground: mean score on ITED<-2, general back- 

ground in the natural sciences 

I2<41.2 Quantification: mean score on ITCD-4, ability to do 
quantitative thinking 

I2-11.3 Science Interpretation: mean score on ITED-6, inter- 

pretati<m in natural science 

12- V> Variation of Science and Quantification Ability 

I2-V.1 Science background variation: variance in scores on ITED-2 , 

general baclcground in the natiiral sciences 

I2-V.2 Quantification background variation: variance of scores 

on ITED-4, ability to do quantitative thinking 

12- V.3 Science interpretation variation: variance in scores on 

ITED-6, interpretation in natural science. 

13- ii. Kean Social Science Ability 

13- I1.1 Social concepts: mean score on ITED-1, understanding 

of basic social concepts 

I3-H.2 Social studies interpretation: mean score on ITED-5 , 

interpretation in social studies 

I3*4i.3 Infomation source utilisation: tsean score on ITED-5, 

use of sources of information. 
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I3-V. Variation of Social Science Abllit 



I3-V.1 Social concepts variation: variance of scores on ITED-1, 

understanding of basic social concepts 

I3-V.2 Social studies variation: variance of scores on ITBD-5, 

interpretation in social . tudies 

I3-Y-3 Information source utilization variation: variance in 

scores on ITED-9, use of sources of information, 

Tiie nine subtests on the Iowa Test of Educational Development were 
titioned into three groups: 1) verbal ability, 2) science and quanti i* 

cation ability, and 3) social science ability. Each group consists of 
three tests. For each group, a pair of clusters is defined; one cluster 
for means and one for variances. 

Aspiration . The aspiration clusters consist of the following 
indicators: 

Jl-ll. Mean Aspiration 

Jl-14.1 Educational expectation: mean education level the 

students expect to attain, based on a 10 point scale 
from 1 * less than high school graduation to 10 * graduate 

study 

J1-M.2 Educational discrepancy: mean discrepancy between the 

education levels expected by the students and those 
achieved by their fathers 

J1-H.3 Girls* aspirations: mean occupational aspiration of 

girls, based on a five point scale from 1 ** full"time 
ho me maker to 5 * fuU^iiae career. 

Jl-V. Variation in Aspiration 

Jl-V.l Educational expectation variation: variance of the 

education levels the students expect to attain, based 
on a 10 point scale from 1 less than high school 
graduation to 10 - graduate study 

J1-V.2 Educational discrepancy variation: variance cf the 

discrepancies betxreen the educational levels expected 
by the students and those achieved by their fathers 

J1-V.3 Girls' aspirations variation: variance of occupational 

aspirations of girls, based on a five point scale from 
1 * full-time hoTAeoaker to 5 * full-time career. 

For each student in twelfth grade, three measures are computed: 1) edu- 

cational expectation level, 2) discrepancies between expected educational 
level and educational level of the student’s father, and 3) occupation 
aspiration of girls. The educational expectation level generally indicates 
the socioeconomic aspiration of the student. The discrepancy with the 
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father's educational level refines this by indicating the degree to 
which the expectation is upward •mobile. Occupational aspiration infor- 
mation was not available for boys, but for girls an item response 
indicated whether the girl aspired to a career, full-time or part-time, 
or to be a homemaker. The career aspiration of girls may be considered 
an indicator of the general socioeconomic aspiration of the students. 
Because these measures are individualistic, two clusters were formed, 
one for mean-level and one for variation. 

Emotional Outlook s The emotional outlook clusters consists of the 
following indicators: 

Kl-M, Mean Adjustment 

Kl-ii.l l!arkpoint variance: mean student markpoint variance, 

computed as the varicncr- of the marks that a student 
receives in academic courses 

K141.2 Aspirational discrepancy: mean difference between 

what educational levels students believe they could 
attain and what levels they expect to attain, based on 
a 10 point scale from 1 less than high school gradua- 
tion to 10 s graduate study 

Kl'^.S Uncertainty: percent of students idio are uncertain of 

their post-high school educational plans 

Kl-ti.4 Capacity learning: correlation coefficient between 

MPA's and conq;>o8ite ITED scores 

Kl-V. Variation in Adjustment 

Kl-V.l liirkpoint variance variation: variance of student 

markpoint variances, computed as the variance of the 
loarks that a student receives in academic courses 

Kl-V. 2 Aspirational discrepancy variation: variances of 

differences between the educational levels that students 
believe they could attain and what levels they expect to 
attain, based on a 10 point scale from 1 * less than 
high school graduation to 10 » graduate study. 

For each student in the twelfth grade, a special mark summary coefficient 
was formed— the markpoint variance. This was computed as the variance 
of the marks received by the student in academic courses, and It is 
considered to be a measure of consistency of performance. It is assumed 
that consistency is an aspect and indicator of adjustment. Another 
indicator was computed from tvTO parallel questions asked of the students: 
what educational level they expected to attain, and what level they 
thought they could attain. The discrepancy between the responses is also 
considered a measure of adjustment. The means and variances of these 
two measures were computed over students and assigned respectively to the 
aean-level and variation adjustment clusters. TWo additional measures 
were computed to Indicate mean-level of adjustment. The first is a 



-119 



measure o£ uncertainty; i.e*, percent of students who were uncertain of 
their future educational plans. The second is a measure of the dis» 
crepancy between test score and ilPA: specifically, it is the correla- 

tion coefficient between the two* \hen there is a high correlation, 
then it can be said that students are working to capacity. A lower 
coefficient may Indicate emotional distraction or interference. 



D. Analytic Techniques 



This section contains the theoretical bases, statistical proper- 
ties, computat ion.3 1 formulas, and interpretative implications of the 
analyses performed on the data. The indicator variables were extracted 
from the Iowa and U.S. Census data banks by a series of computer pro- 
grams, which are briefly described below, and were fed through a four- 
stage c^puting procedure before the final results were obtained. The 
first stage— filtering— has not been discussed previously, since its 
need became apparent only during the final stage of the project. The 
second stage involved compositing the indicators by clusters according 
to priiicipal component analysis. The third stage involved factoring 
the clusters within each supercluster— Community Input, School Media- 
tion, and Student Output— according to image analysis. The fourth 
stage involved regression and graphical analysis of the interrelation- 
ships of the image factors. 

The principal ccanponent compositing, the image analyses, and 
some of the stage four analyses were first performed on indicator 
variables as calculated from essentially the entire population of Iowa 
secondary schools. It was apparent that there were complicating data 
in the indicator data matrix. In particular, image factor scores, 
which automatically have means of zero and standard deviations of one, 
were found for some schools to exceed plus or minus ten. This indicated 
enormously skewed distributions^ even though after the compositing 
operation one would have expected approximately normal distributions. 
There were two possible reasons for the skewness; either there vere 
schools which were most \anusual in terms of their educational situations 
or qualities, or there were erroneous data mixed in with good data. 
Consequently, a manual examination was made of the data for schools 
with extreme factor scores. (Careful monitoring of all the input data 
was not possible because of the vast amount of data processed and 
because of the desire to establish the probability of just these kinds 
of problems.) It was found that the extreuie values generally corres- 
ponded tc missing or inconsistently coded data and not to unusual data. 
Some procedure had to be developed for omitting such data or for pro- 
viding suitable substitutes. 

Except in a few isolated cases, in which census data were in- 
correct because of manual errors in the mapping operation, all the 
errors were found in the Iowa educational data bank data. It should 
be stated that the errors encountered are not those which would 
significantly affect the utility or vaJ.ue of the Iowa educational data 






bank. fact, the perronnel afc the data bank operate viith the expecta- 
tion that data will occasionally be missing or incorrect. The amount 
of time spent correcting data is proportional to the necessity that the 
data be correct. For example, it is necessary that each teacher's 
s<jicial security number be recorded correctly. If, however, a teacher’s 
current assignment is incorrect or missing, no irreparable damage is 
done. The objective of data processing systems is service. If a school 
fails to convert its grading codes to the statewide code, thr most that 
can happen in terms of service is that the school will not receive 
computed grade points. The data received from the Iowa tank is certain^ 
sufficiently accurate and complete for administrative purposes. A question 
must be raised as to whether it is satisfactory for research problems 
such as the one reported herein. 

Twc problems were envisioned had the erroneous data been retained 
and ignored. First, they generally depress correlations. Since from 
preliminary analysis it was known that the correlations would be low, 
this was not acceptable. Second, graphical presentations were planned. 
Therefore, such data were not acceptable because the scaling of the 
graphs would have been overextended. Third, such data make inappropriate 
the definition and computation of orthogonal factor scores. This last 
point is not widely known, but may be illustrated by an example. Given 
two orthogonal components derived from a set of variables by the method 
of principal compoiients, one might expect the variates to be bivariate 
normally distributed, with correlation zero. Yet, all orthogonalization 
guarantees is that the linear correlation is zero, while the bivariate 
distributions may consist of the mixture of a bivariate normal distribu- 
tion, with high correlation, and one extreme outlier placed in such a 
way as to negate the correlation of the other points. 

The problems envisioned were considered serious enough so that 
extreme measures were employed to clean the data. The operation is 
called filtering and is described later in this section. It consisted 
of omitting all schools which were in the extreme tails of the distribu- 
tions one or more variables. This operation might not have been 
necessary if different kinds of analysis had been employed. In this 
research, the school is the unit under study and the sample size is 
actually qaite raall, being in the hundreds. If students were the units 
of study, such cata would lu;ve been less serious, since students numbered 
in the hundred thousands. The particular analytic techniques used here 
were especially sensitive to erroneous data. Yet, it must be concluded 
that the data should probably be considered inadequate for the purposes 
of the present reseai'ch. By the end of the filtering operation, about 
half the schools had been omitted from the analysis. The investigators 
cannot answer the crucially important question of whether schools having 
such data correlate with the high school size/non-doUar costs relation- 
ship; i.e., do the schools omitted fr<xn consideration form a special 
sub -population of schools in terms of their educational or environmental 
characteristics? Does failure to supply data indicate special qualities 
which are related to how schools are operated and, in turn, to how 
students are taught and learn? 
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CoRslmction of the Indicators 



Prior to performing any of the analyses, it was necess^y to ri^ 
a series of computer programs which extracted the indicator ^formation 
from the data bank tapes. In the following paragraphs are given brief 
descriptions of the programs and their purposes. All the programs were 
written in a combination Fortran-63 and Compass machine language for the 
CDC 3600. 

Census data processing . Three programs ran sequentially to produce 
the indicators use in Community Input*. The first program read the census 
data tapes and the mapping cards. The contents of the mapping cards 
are described in Chapter 4; essentially, there was one mapping card for 
each census unit, giving the proportional assignment of the unit to 
school areas. The first program collated the two files. It produced 
an output tape file which consisted of a copy of the census data plus 
tags indicating for each unit which schools with which proportions were 
assigned to the unit. The second program then made repeated passes 
through the new tape file. There was one pass for each successive group 
of 28 schools— this limit was imposed by computer memory size. The 
values of the census items for the 28 schools were accumulated internally 
during a pass. As each tagged data set was read, its tags were examined 
to see whether or not the unit was assigned, to any extent, to any of 
the 28 schools; if it was, its vauLues were added to the accumulated s\ims 
for the school after multiplying by the indicated proportions. The out- 
put tape file of the second program served as input to the third program, 
which formed the ratios, percentages, etc., needed as the Community 
Input indicators. 

Student data processing . The student data were recorded on a 13- 
tape file* One computer program was used to process the information and 
to produce most of the indicators for School Mediation and Student Out 
put. The program was run 13 times, once for each tape in the file. 

The program read the student records school by school. For each school 
it determined the averages, proportions and variances required for con- 
struction of indicators* In addition » it reconstructed the assignment 
of students to classrooms on the basis of student course records , and 
determined the averages and variances jf classroom qualities. For 
each school, the program generated the constructed indicators. 

Teacher data processing . Two clusters of indicators in the School 
Mediation group required examinations of the Teacher data file. For 
this purpose a program was run which read the 2 -tape teacher file and 
computed the necessary means and variances by school. 



Filtering the Data 

The data processing operations described above yielded a data 
matrix on tape of about 120 indicator variables and about 400 schools. 
As noted above, there were many missing or miscoded data. A quick 
visual examination was first made of a printout of the data. This led 
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to the omission of about 70 schools. A preliminary set of principal 
component and image analyses was then made. At that tine, the extreme 
factor scores were detected during a second visual examination of a 
printout. A detailed examination was made of the data for which the 
largest extreme scores had been obtained. It was found that, in all 
cases, those schools had consistently miscoded or missing data. In 
one school, no seniors had filled out the general information question- 
naire. In another school, only half the students had grades reported. 

The large amount of missing and incorrect data in these schools suggested 
that scores other than the extreme ones noted might be spurious. The 
only possible solution was to omit those schools. 

Because of the amount, it was not possible to check all the source 
data manually. The existence of undetected erroneous data was con- 
sidered sufficiently certain that it was necessary to devise an automatic 
procedure for clearing the data matrix of schools with such data. The 
"filtering” operation consisted of a computer program which scanned 
the indicator data matrix and computed the means and standard deviations 
of each indicator variable. A second pass was then made through the 
matrix. This time, each variable for each school was compared to the 
mean and standard deviation of the variable for the population. If one 
or more of the values exceeded the mean by 3.5 or more standard devia- 
tions, the school was omitted. In all, llO schools were omitted, and 
the final sample size for analysis consisted of 220 schools. 

The 3.5 standard deviation criterion would correspond roughly to 
the omission of 57. of the schools in the case vdiere 120 independent 
normally-distributed variables are processed. Because of the non- 
normality and dependency of the data, roughly 30% of the schools were 
omitted. This must be considered scmiewhat detrimental in terms of the 
representativeness of the sample of schools. The only apparent alter- 
native was to have transformed the distributions of the variables; i.e., 
to have substituted standard values of 3.5 for standard values of more 
than 3.5. However, because of the correlated nature of the questionable 
data— e.g., if a school was missing a large number of questionnaire 
responses, many variables would be affected— substitution would probably 
have led to less obvious but just as spurious final results. 



Composition of the Clusters 

Given a set of variables, the principal component, as defined by 
Hotelling (1935), is a certain linear combination of the standardized 
versions of the variables. The weights in the combination are deter- 
mined by the largest eigenvalue . of the correlation matrix of the 
variables and by its corresponding unitized eigenvector. Specifically, 
the weight for the X^h standardized variable is equal to the l^th entry 
in the eigenvector divided by the square root of the eigenvalue. The 
correlations between the variables and the component score are similaxly 
definable. The correlation between the ij:h item and the score is equal 
to the JLth entry in the eigenvector multiplied by the square root of the 
eigenvalue. 



The principal component is the linear combination of the variables 
which represents the maximum amount of variance that can be extracted, 
by a single linear combination, from the variables. This can be under- 
stood best in the sense of correlation between the variables and the 
component score. With the principal component, the sum of the 
of these correlations is maximized. Wlien the eigenvalue on wliich the 
principal component is based is compared to the number of varia es, 
the result is the proportion of the total variance of the variables 
which is extracted by the principal component. 



When the indicators of a cluster have homogeneously high corre- 
lations, the proportion of variance extracted by a Pf 

analysis is high. For example, for the (hypothetical) correlation matrix 



1.0 .8 .8 

.8 1.0 .8 

.8 .8 1.0 



82.57. of the variance is extracted. If the correlations are homogeneous, 
but low, the principal component is still an efficient blend of the 
variables, though a smaller proportion of the variance is extracted. 

There are problems, however, if the correlations are heterogeneous. Two 
cases should be considered. First, there may be distinct structures 
in the correlation matrix, and the principal component may extract and 
represent only one of the structures. For example, the principal 
component of this matrix 



1.0 .8 .0 .0 

.8 1.0 .0 .0 

.0 .0 1.0 .6 

.0 .0 .6 1.0 



correlates only with the first two variables. Second, there may be 
systematically small correlations but with different signs, and the 
principal component may correlate positively with some variables and 
negatively with others. For example, for this correlation matrix 



1.00 .06 -.10 

,06 1.00 -.08 

-.10 -.08 1.00 



the principal component correlates positively with the first tw var- 
iables, but negatively with the last. This would be considered unde- 
sirable if the three variables were to be considered positive but 
independent aspects of some quality— a hypothesis not disproved by the 
signs because of the small correlacions. 
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Many instances of these effects are realized in the data analysis 
for this project ^ yet principal conponents analysis was the only 
available appropriate analysis. There is need, therefore, to promote 
the development of a new statistical technique. The problem is to 
devise an autooiatic wei^ting procedure for compositing heterogeneously 
correlated variables»-for determining a score which correlates highly 
and with specified directions with all the variables in the set. One 
criterion which mi^t be used may be called *' m a xi m in **. The criterion 
is that all variables should have the specified sign and the smallest 
correlation (in absolute value) should be maximized. A computational 
procedure for the maxim in cocuponent has not been determined, althou^ 
considerable time has been expended in trying to find one and in trying 
to identify the properties the maxiialn compooent would have. It is 
not clear whether a properly signed component always exists or whether 
the aui»iinin if unique. If, however, such a component technique can be 
found, it would be a useful addition to the indicator^cluster** factor 
methodology employed in this project. Perhaps for each cluster ^ the 
decision to use principal or maxim in component analysis would be nmde 
on the basis of the theory for clustering the indicators or on the 
basis of the determined correlation matrix* 

However, principal component analyses were used on the data for 
this project, and it will now be described how such analyses can be 
interpreted. First, the proportion of variance extracted is a measure 
of the hoiBogeneity and magnitude of the correlations# A hi^ proportion 
indicates that the principal component score is a good surrogate for 
the set of variables# In terms of predicting sooie outside variable, 
the principal cooq>onent will probably serve almost as well as the 
entire set of variables# On the other hand, a low proportion indicates 
that a considered amount of the predictive power of the indicators has 
been lost in the compositing. The correlations or loadings between 
tbe component and the variables indicate the degree to which the individ* 
ual variables a^a included in the ccmposite and the directional 
relationship that they have with the component. Certain variables nay 
have negative loadings and others may have essentially zero loadings. 

The pattern of correlations is examined for the purpose of reinter* 
preting the cluster; if the signs or naagnitude of the loadings are 
not as had been expected, the meaning of the cluster as me asu r ed may 
not be the same as the meaning originally intended. The name of the 
cluster may then have to be redefined. 



Image Analysis 

The term "factor analysis" refers to two related models: **pwre" 

factor analysis, in which there is one unique factor for variable 
as well as a set of common factors, and '‘component analysis", in which 
there are only common factors and they are combinations of the variables# 
With pure factor analysis (alpha and canonical factor analysis) a 
relatively reasonable model is achieved, but factor scores are not 
computable# With component analysis, a less reasonable model is 
achieved, but factor scores are computable. Because the factor scores 



were to be used In the stsge four computations of the project, it was 
necessary to use a component analysis model* The component analysis 
approach chosen was image analysis, which was originally fomulated by 
Guttman (1953) and later extc>''/)ed and improved by Harris (1962)* Image 
analysis is a component analysis technique which is intended to 
approximate pure factor analysis* 

In image aialysis, there is an isq^licit transformation of the 
variables* Each variable is transformed implicitly into its "image", 
which is defined as its regression estimate*«i*e* , the variable as 
predicted from the other variables* Also defined implicitly is the 
"anti-image" which is the residual in the regression* In image analysis, 
the factors are determined on the basis of the images rather than on thm 
basis of the original variables* Because of certain mathematical 
identities, it is not necessary esq^licitly to confute the image variables* 
All the computations can be carried out in terms of the original variables* 

Let Z be the p x N matrix of standard (means * 0« variances * 1) 
original variables* The correlation matrix is H * H*^ZZ^« From the 
correlation matrix, the image analysis may proceed* First, the diagonal 
matrix of anti-image variances is computed: » the diagonal matrix 

of the reciprocals of the diagonal entries of R*^* Note that a diagonal 
entry of is the complement of the squared-multiple-correlation between 
the corresponding variable and all the other variables* It would be 
possible at this point to compote and factor the image covariance isatrix: 

6 « R + S^R*^S^ - ^S^* However, according to the Harris scheme, a 
different matrix is factored: S*^RS**^ * idiere is the diagonal 

matrix of eigenvalues of and the planus of X are the correspond- 

ing unitized eigenvectors* Let Bg^ * (B^^ - I)2B]^*2* Then the Harris 
factors of 6 are Fg * ^^g* Actually only certain of the factors are 
retained. All factors corresponding to roots in B^^ greater than one 
are retained* These factors may then be rotated Kaiser's (1958) 
nomal varimax procedure and FgT is obtained* It can be shown that the 
image factor scores can be coopted from the original variables accord- 
ing to this formula: y « t'b-X'S"^ 2* These factor scores have means 

equal to zero, variances equal to one, and inter correlations equal to 
zero* 



Guttman (1953) originally showed that image analysis is an approxi- 
mation of pure factor analysis* Hhen a universe of variables is 
sufficiently sampled, the image factors approximate the common factors* 
Guttman further show^ that this approximation is appropriate when the 
correlations between the anti-images are close to zero* The anti-image 
correlation matrix is SR~^S* Note that SR*^S is the inverse of the 
matrix, S^^RS*!-, which was factored* This iflq>lies that the small roots 
of S*^RS*^ correspond to the large roots of SR^l-S* By omitting the 
small roots of S**RS*^, the large roots of SR*^ are omitted from the 
data analysis model; i*e*, the reproduced anti-image correlation matrix 
would have, as Guttman shows to be important, small off-diagonal entries* 
This approach was developed by Harris (1962) and he also provides further 
justification for it* 

In Section 6-3, four matrices are presented for each image analysis. 
First, there is the correlation matrix of the original variables* Second, 
the diagonal entries of S^ (i*e*, the anti-image variances or the 
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complements of the squsred-multiple-correlations) are given. Third, 
there are the Harris roots of G; B Finally, the rotated iMge factor 
matrix, F»T, is given. The anti-image variances indicate the degrees 
to which the original variables are not predictable from another— 
i.e., the proportions of the variables lAich approximate unique variance. 
The magnitudes of the omitted roots in Bg^ (roots corresponding to roots 
in Br^ less than one are omitted) indicate the amount of anti-image 
correlation. The hi^er these values, the less valid is the approx 
tion to pure factor analysis and the less likely it is that ^e un verse 
of variables has been properly sampled. The factor matrix gives the 
cross-covariances of the image variables and the rotated factors. The 
«»iin-ing of a factor is determined on the basis of the variables vh c 
have the highest covariances or loadings. The factors which are 
important are those which have the highest suais of squares of loadings. 
The variables which are essentially ignored in the analysis are those 
which have the lowest sums of squares of loadings. 



Regression Analysis 

Although planned, no extensive advanced regression analysis was 
used. To get a sense of the data, a set of simple multiple regressions 
was performed. These techniques are well-known and require no discussion. 



Contour-Plotting 

Somewhat tangentially to the regular course of this research, a 
statistical technique was developed and tested. The technique not 
new but rather is an application of a computer technique often used in 
geophysical research. A geophypiclst. might have gravity readings over 
an area and want to diagram the hypothetical gravity surface to aid his 
search for anomalies. To provide such diagrams, geo|^ysicist have 
developed computer programs which accept as input levels each point on 
a grid and which produce as output a contour map of the surface. A con 
tour map is a map in which lines are drawn connecting places of equal 
level or value. 



The statistical application of contour plotting concerns the 
examination of the relation of two variables as in diagramming their 
joint distribution. One solution commonly employed is tc produce a 
scatter plot, ftjwever, this is not generally appropriate when there 
are many observations. Even when there are few, scatter plots only 
suggest the term of a distributim. However, if the data are plotte 
on a grid and each variable is transformed by a lir^ar function into 
the range 1-n where n is some chosen integer, a contingency table My 
be produced, A contour plot may then be made, considering the grid 
points as equally spaced and the frequencies as levels. The 
then in the fom acceptable to a geophysical plotting program and the 
shape of the distribution can be explicitly diagraniiied. 
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This technique seems to have great promise In terms of finding and 
defining non«llnear distributions, A more important application may be 
in detecting mixture distributions— i.e,, distributions idiich are the 
sum of several different distributions of several different subpopulations. 
Tuo examples of frequency distribution contour plots are given in Section 
6-C. 






CmFIER 6 

RESULTS AMD INTERFREIATIOMS 



This cha!>ter contains four sections* In the first section* 
the principal components analysis lAich tias performed on the clusters 
of indicators is described* The second section describes the factor 
structures. The third section contains the regression structures. 
The fourth section is a concluding statement. 



A. Properties of the Clusters 



In this section are tabulated the results of the principal com- 
ponent analyses performed on the clusters of indicators. The discossimi 
is divided into three parts— one for Community Input* one for School 
Mediation* and one for Student Output. Principal component analysis 
seems to be inappropriate for clusters which contained non-homogeneously 
correlated indicators and the principal component fails to correlate 
with or correlates with the wrong sign of some of the indicators. The 
discussion is aimed* therefore* at redefining or clarifying the meaning 
of the cluster measurements in terms of their actual computational 
relationships with the indicators. For each group of clusters* the 
means* standard deviations* intercorrelations* and principal component 
loadings of the indicators sre given for each cluster within the group. 



Coamunitv Input 

The Cocanunity Input clusters were more appropriately analyzed by 
principal component method than the clusters of the other superclusters* 
since higher and fnore homogeneous inter- indicator correlations were 
found. The analytic results are discussed in the following paragraphs. 

Population Dynam-tcs . The analytic results for these clusters are 
presented in Table 6*^-1. The three growth indicators have a non- 
transitive correlation structure: birth rate and immigration are 

each correlated with residence newness* but they are not correlated 
with each other. This is reasonable since a high birth rate and a high 
jwimtin t- of immigration are each an independent source of residence 
construction. The principal component loads highly on all three 
indicators. The structure of the youth indicators is more complicated. 
Infants and children are correlated as are young men and young women; 
also* infants is correlated with young men and young women* but children 



ERLC 



TABLE 5-A-l 



CLUSTER ANALYSIS RESULTS FOR 
COflUNITY-INPUT— POPULATION DYNAl<aCS 



Clusters liean S* D* Correlations Principal 

Component 

Al* Growth 

(48*3% of the variance 
was extracted) 



Al*l 


Birth Rate 


21*50 


3*43 


••• *35 •*05 


*62 


Al*2 


Residence Newness 


11*27 


5*37 


.35 *32 




A1.3 


In«mlgration 


13*43 


4*12 


1 

1 

1 

CM 

CO 

• 

• 

s 





A2« Youth 

(53*2% of the variance was 
extracted) 



A2*l 


Infants 


12*29 


1.4S 


— 


.51 


.44 


.37 


.78 


A2*2 


Children 


26.15 


1.89 


.51 


— 


.13 


-.05 


CM 

• 


A2*3 


Young Hales 


4.58 


.66 


.44 


.13 


— 


t72 


.85 


A2*4 


Young Females 


4.93 


.67 


.37 


1 

8 


.72 


••• 


.78 



A3* Density 



A3.1 


Structure Density 


1.11 


.15 


— 


-.40 


•15 


-.43 


A3.2 


Unit Density 


3.26 


.23 


-.40 


— 


.55 


.94 


A3.3 


Room Density 


.49 


.03 


.15 


.55 


— 


.74 



as an indicator is not. This coiiplication is doubtless related to the 
presence of families with infants yielding high proportions of young 
adults and infants and to a general tendency for a comaunity to have 
many or few infants and children. All four indicators have high corre* 
lation with the principal component. The density cluster exhibits a 
non- transitive correlation structure; structure density and room density 
are each correlated with unit density but not v/ith each other. It 
should also be noted that the correlation between structure density an 
unit density is negative. Unit density (persons per unit) is high when 
structure density (units per structure) is low or when room density 
(persons per room) is high. These empirical results imply that density 
should be interpreted in terms of many people in a small living area. 

Population Characteristics . The analytic results for these clusters 
are presented in Table 6-A-2. The rural/urban cluster measurement was 
computed in such way that rural areas have high scores. The correlational 
structure of the three indicators is non-transitive. Urban residence 
is negatively correlated with rural non-farm residence and agriculture 
employments but the latter two are essentially uncorrelated. This 
result suggests that in a non-urban area, non-farm residence and 
employment in agriculture are essentially independent. The principal 
component correlates highly with each of the indicators, the correla- 
tion being negative for urban residence. The socioeconomic level 
cluster is somewhat confused because of the difficulty of coding 
occupation levels in rural areas. The farming occupation was given a 
relatively high occupation score, yet farmers generally have less income 
and education than high**status urban workers. In this cluster then, 
occ* j '.tion level correlates negatively with income level. However, the 
principal component correlates positively and highly with income level, 
and is probably an adequate measure of socioeconomic status. The 
indicator correlations for socioeconomic variation are essentially zero. 
The pattern of signs is the sane as for the previous cluster, and the 
principal component may be considered to represent variation in socio- 
economic standing. 

Soc i alization Characteristics. The analytic results for these 
clusters arc presented in Table 6-A-3. For the social activity involve- 
ment cluster, the social workers indicator is luicorrelated with the 
other indicators and has a low correlation with the principal component. 
On the other hand, public employees and female employees are correlated, 
and both correlate highly with the principal component. However, the 
meaning of this cluster is obscure. The two indicators of the social 
mobility clusters are essentially uncorrelated, but what correlation 
exists is negative. This makes the indicators correlate with the 
principal component in opposite directions; l.e. an area with a high 
score on the principal component would probably have a low heterogeneity 
but a high population mobility. This must be related to the urbaniza- 
tion processes in Iowa. There are three poverty culture indicators, 
and the last, unemployment, is essentially uncorrelated with the other 
two and has a low correlation with the principal. component. Apparently, 
unemployment is not a regular problem in Iowa. The other two indicators, 
low income, and poor housing are correlated and both have high correla- 
tions with the principal component. 
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TABLE 6-A-2 



CLUSTER ANALYSIS RESULTS FOR 
CXMIUNITY- INPUT— POPULATION CHARACTERISTICS 



Clusters 



Kean S. D. Correlations 



PrlTicipal 

Component 



21. Rural /Urban 

(69.9% of the variance 
was extracted) 



Bl.I Urban Residence 


13.70 


26.32 




o 

00 

• 

1 


-.67 


-.99 


B1.2 Rural Non-Fam 


40.89 


19.13 


0 

CO 

• 

1 


— 


.12 


.79 


Residence 


Bl*3 Agriculture Employment 


40.69 


14.60 


-.67 


.12 


mm mm mm 


.69 


Socioeconomic Level 
(58.87L of the variance 


was extracted) 
B2.1 Income Level 


4.99 


.55 


••• 


.39 


-.60 


.91 


B2.2 Education Level 


5.C2 


.20 


.39 


mm^m 


1 

• 

o 


.56 


B2.3 Occupation Level 


7.40 


.48 


-.60 


0 

• 

1 




1 

• 

VD 


Socioeconomic Variation 
(38.9% of the variance was 
extracted) 


23.1 Income Variation 


8.12 


1.01 


mmtm 


.06 


1 

• 

o 


.63 


B3.2 Education Variation 


2.02 


.21 


•06 




1 

• 

o 

00 


.57 


B3 • 3 Occupation Var ia t ion 


9.20 


1.11 


0 

• 

1 


00 

0 

• 

1 


— 


-.67 



in r-— 



TABLE 6-A-3 

CLUSTER ANALYSIS RESULTS FOR 
COMbfUNITY INPUT — SOCIALIZATION CHARACTERISTICS 



y 



er|c 



Clusters 


Mean 


S. D. 


Correlations 


Principal 

Component 


Cl. Social Activity/Involvement 
(45. 5Z of the variance 
was extracted) 














Cl . 1 Social Workers 


2.25 


1.69 


— 


.13 


.00 


.30 


Cl. 2 Public Employees 


2.56 


1.40 


.13 




.34 


.83 


Cl. 3 Female Employees 


35.74 


7.84 


.00 


.34 




.77 


C2. Social Mobility 

(37.2% of the variance 
was extracted) 














C2 . 1 Population 

Hetvrogeneity 


15.73 


6.98 


— 


-.14 




-.76 


C2.2 Population Mobility 


22.93 


4.69 


-.14 






.76 


C3. Poverty Culture 

(45.2% of the variance 
was extracted) 














03 . 1 Low Income 


22.10 


6.25 


— 


.32 


-.03 


.72 


C3.2 Poor Housing 


41.62 


11.13 


.32 


— 


.19 


.84 


C3 . 3 Unemp loyment 


2.68 


1.73 


-.03 


.19 




.37 
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Economic Structure. The analytic results for these clusters are 
presented in Table 6-A-4. In the wealth cluster, property value and 
rent value have a substantial inter correlation and are both hi^ly 
correlated with the principal component. Cut automobile availability has 
very low correlations with property value and rent value and only a 
moderate correlation with the principal component. Apparently, the 
availability of automobiles is confounded by other factors— perhaps the 
age and type of automobile. Both the industry and trade clusters con- 
sist of highly and homogeneously correlating indicators, and the 
correlations with the principal components are high. The distinction 
made between industry and trade is not clear. The farming cluster 
consists of variables with low intercorrelations. However, the principal 
component seems logical. It correlates negatively with family farm help 
and positively with paid farm help and food industry. It would seem, 
then, that an area with a high value on the component would have a 
cononercial frrming industry rather than family farming. 

Educational Resources. The analytic results for these clusters are 
presented in Table 6-A-5. The first cluster, archive usage, contains 
just one indicator, librarians, and the principal component is equal to 
the indicator. However, the indicator librarians is probably without 
value, since the census data on which it is based are too gross. The 
school areas are relatively small, the count of employed librarians is 
based on a l-in-4 sample, and librarians are rare. Therefore, the 
estiraated number of librarians per 1,000 population tends to be inaccurate. 
The two educational involvement indicators, school enrollment and school 
employment are essentially uncorrelated. This is probably due to the 
fact that rural teachers do not always live in the area where they 
teach. Both indicators correlate highly with the principal component. 

The two indicators for the educational attainment cluster, college 
attainment and professional employment, have a high intercorrelation 
and both correlate highly with the principal component. 



School Mediation 



It must be admitted that the measurement achieved of the school 
mediation clusters was generally Imperfect. Part of the reason was the 
absence, noted earlier, of data appropriate to measure the kinds of 
social processes theoretically considered important. Part of the reason 
was the lack, also noted earlier, of an analytic technique for composit- 
ing non-homogeneously correlating variables. Another basic problem with 
the data bank information became apparent upon examination of the cluster 
analysis results and the associated source data. All but the most gen- 
eralized curricular control is exercised by local school .officials. 
Consequently, course contents, requirements, and standards vary widely. 

On the other hand, in fitting the hundreds of curriculum patterns into 
a statewide code, many quite inappropriate equivalences had to be assumed 
by the founders of the data bank. For example, English IV may be a 
course required of everyone in one district and a course taken only by 
the best five percent of students in another. The general problem is that 
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TABLE 6-A-4 



CLUSTER ANALYSIS RESULTS FOR 
COMMUNITY INPUT-ECONOMIC STRUCTURE 



Clusters 


Mean 


S. D. 


Correlations 




Principal 

Component 


Dl. Wealth 

(50.8% of the variance 
was extracted) 

Dl . 1 Property Value 


00 

• 


.73 




.45 


.12 


.81 


D1.2 Rent Value 


6.02 


.78 


.45 


— 


.15 


.83 


D1.3 Automobile 

Availability 


1.12 


.10 


.12 


.15 


— 


.43 


D2. Industry 

(65.2% of the variance 
was extracted) 

D2 . 1 Industry Emplo^mient 


22.82 


7.95 




.34 


.63 


.83 


D2 . 2 Managerial Employment 


6.34 


2.43 


.34 


— 


.44 


.70 


D2.3 Clerical Employment 


7.53 


2.95 


.63 


.44 


— 


.88 


D3. Trade 

(75.3% of the variance 
was extracted) 

D3.1 Retail Employment 


14.45 


4.67 




.58 


.73 


.89 


D3.2 Service Employment 


5.52 


2.08 


.58 


— 


.58 


.82 


D3.3 Sales Employment 


5.36 


2.35 


.73 


.58 


— 


.89 


D4. Farming 

(40.7% of the variance 
was extracted) 

D4.1 Family Farm Help 


.11 


.07 




-.19 - 


-.02 


-.69 


D4.2 Paid Farm Help 


.18 


.08 


-.19 




.10 


.76 r 


D4.3 Food Industry 


3.44 


2.69 


-.02 


.10 




.40 
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TABLE 6-A-5 



I 



I 



CLUSTER ANALYSIS RESULTS FOR 
COMMUNITY INPUT-EDUCATIONAL RESOURCES 



Clusters 


Mean 


S. D. 


Correlations 


Principal 

Component 


El. Archive Usage 

(100% of the variance 
was extracted) 

El . 1 Librarians 


.32 


.39 






1.00 


E2. Educational Involvement 
(53.5% of the variance 
was extracted) 

E2.1 School Involvement 


25.73 


2.83 




.07 


.73 


E2.2 School Employment 


5.48 


1.91 


.07 


— 


.73 


E3. Educ«.^lonal Attainment 
(74.5% of the variance 
was extracted) 

E3.1 College Attainment 


13.15 


3.56 






.86 


E3.2 Professional 
Employment 


7.01 


2.30 


.49 


— 


.86 
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altk the data bank codings have administrative meanlng^-e.g* , in 
both cases above, "Bngllsh IV" Is the proper printout for class lists— 
they do not always have substantive meaning, 

StliMilatloti Siibat^ge , The analytic results for these clusters are 
given In Tid>le 6<*A-6. The problem of general coding Inccmslstency dis- 
cussed abovo Is especially Important for these clusters. It was originally 
Intended that these clusters represent different substances and not be 
tied to administrative distinctions such as courses. For example, 
literature discussed In a history course was English stimulation nonethe- 
less, Since this was not possible, the administrative distinctions had 
to be used. Even then, the curricular differences between scb'ools has 
essentially prevented any measurement of stimulation substance. The 
Indicators within each of the five clusters have very low correlations. 

It does not seem profitable to attempt explanation of the patterns of 
correlations or to Interpret the meaning of the principal components. 
Although these concepts are doubtless essential to the question of hl^ 
school size and non-dollar costs— e.g., large schools may very well provide 
more stimulation In certain areas— they could not be measured. 

Stimulation Structure. The analytic results for these clusters are 
presented In Table 6Wk-7. The correlations for the mean and variation 
teacher experience clusters are fairly high and all the indicators 
correlate highly with their respective principal components. The indica- 
tors selected for classroom hererogeneity seem to be essentially uncorre- 
lated both for mean-level and variation. As noted, principal compements 
analysis yields spurious results In such cases; only 33 percent and 27 
percent of the variances of the indicators Is extracted. 

Satisfaction. The analytic results for these clusters are presented 
In Table 6-A-8. For the first cluster, mean attitude toward study, an 
Indicator was omitted and the resulting two indicators do not define 
a meaningful concept. The second cluster, var laticm in attitude toward 
study, has low interlndlcator correlations, and principal components 
analysis has yielded a component with unlnterpretable signs. The mean 
social Interaction cluster has one moderate interlndlcator correlation 
between teacher attitude and activities. There Is apparently a more 
comfortable social relationship between teachers and students when they 
Interact in activities. Both Indicators correlate hi^ly with the prin- 
cipal conq>onent. The Indicators for variation of social interaction 
have positive correlations and correlate highly with the principal 
compement. The composite measure apparently indexed variation In social 
attitude and participation— l.e., the presence of both activists and 
loners. 



Student Output 

For student output, the more concrete clusters, which involved an 
established teat battery and a definite end obvious aspiration scale, 
seemed quite amenable to the principal component analysis. The abstract 
clusters, which represented attempts Indirectly to measure emotional 
qualities, yielded equivocal results. 



TABLE 6-A-6 



CUJStER ANALYSIS RESULTS FOR 
SCHOOL MEDIATION-STIMULATION SUBSTANCE 



Cluster 



Mean S. D. Correlatloas 



Principal 

CoBponent 



FI. Matb'">^tlc8 

(32. i- oT the variance 
was extracted) 

Fl.l Advanced Mathenatlcs 

FI. 2 Math. Consistency 

FI . 3 Matheaat Ics/Llke 

FI. 4 Mathenatics/Dlslike 



21.07 


10.84 


— 


-.03 


Os 

o 

• 

1 

Os 

o 

• 


.45 


.77 


.10 


-.03 




.04 .05 


-.06 


14.73 


4.86 


.09 


.04 


.26 


.74 


24.64 


6.79 


-.09 


.05 


-.26 1.00 


-.75 



4.20 


8.19 


— 


-.08 


1 

• 

o 


.03 


.23 


.81 


.09 


00 

o 

• 

1 


— 


.00 


.08 


.U 


12.38 


4.95 


-.09 


.00 


— 


-.36 


-.81 


13.11 


5.40 


.03 


.08 


-.36 


— 


.81 



29.85 


16.47 


— 


-.11 


.12 


-. 5.3 


.44 


.81 


.09 


-.11 


— 


-.20 


.08 


-.49 


11.77 


4.46 


.12 


-.20 


— 


-.37 


.76 


18.21 


6.61 


-.13 


.08 


-.37 


— 


-.71 



F2. Science 

(34. 6Z of the variance 
was extracted) 

F2.1 Advanced Science 

F2.2 Science Consistency 

F2.3 Sclence/Llke 

F2.4 Science/Dislike 

F3. English 

(38. 4Z of the variance 
was extracted) 

F3.1 Advanced English 

F3.2 English Consistency 

F3.3 English/Llke 

F3.4 Englj.sh/Dlsllke 

F4. History 

(32.6% of the variance 
was extracted) 

F4.1 Advanced History 32.44 

F4.2 History Consistency .81 

F4.3 History /Like 10.23 

F4.4 History /Dislike 18.73 

F5. Foreign Language 



(37. 5Z of the variance 




was extracted) 




F5.1 


Foreign Language 


13.55 


F5.2 


For. Lang. Consistency 


.71 


F5.3 


For. Lang. /Like 


2.09 


F5.4 


For. Lang. /Dislike 


9.48 



13.08 


— 


.07 


-.05 


.13 


.50 


.07 


.07 


— 


.04 


.15 


.45 


4.31 


-.05 


.04 


— 


-.20 


-.51 


6.73 


.13 


.15 


-.20 


— 


.76 



7.69 


— 


.33 


.32 


.04 


.82 


.30 


.33 


— 


.05 


.15 


.67 


1.95 


.32 


.05 


— 


-.03 


.57 


4.85 


.04 


.15 


-.03 


— 


.23 



TABLE 6-A-7 



CLUSTER AHALTSIS RESULTS FOR 
SCHOOL MEDIATION— STIMULATION STRUCTURE 



Cluster 




Mean 


S. D. 


Cevaelations 




Principal 

Component 


Gl(rl). lieaii Teacher Experience 

(59. 9Z of the variance was 
extracted) 
















61Qf).l 


Teacher Age 


37.74 


3.86 


— 


.47 


.31 




.77 


61(M).2 


Teacher Experience 


4.25 


1.76 


.47 


— 


.41 




.83 


61(M).2 


Teacher College 
Credit 


152.57 


7.72 


.31 


.41 


— - 




.72 


G1(V). Variation of Teacher Exper- 
(49. 5Z of the variance was 
extracted) 
















Gl(V).l 


Teacher Age Var. 


165.04 


90.02 


— 


.35 


.28 




.82 


G1(V).2 


Teacher E]q>erlence 
Variation 


30.89 


26.41 


.35 


— — 


.08 




.68 


G1(V).3 


Teacher College 
Credit Var. 


6U.37 


589.20 


.28 


.03 


— 




.59 


G2Q1). Mean Classrooa Heterogeneity 
(33. 2Z of the variance was 
extracted) 
















G2Qf).l 


Classroom Size 


19.03 


2.83 


— 


.07 


-.04 


.16 


.45 


G2(M).2 


Classroom MPA 
Heterogeneity 


.39 


.14 


.07 


— 


.02 


.17 


.54 


G2(M).3 


Classroom Homework 
Heterogeneity 


2.12 


.40 


-.04 


.02 


— 


.20 


.46 


G2(a).4 


Classroom Expect. 
Heterogeneity 


4.87 


.85 


.16 


.17 


.20 


— 


.79 


G2(V). Variation of Classrooni Heter 
(26. 8Z of the variance was 
extracted) 


• 














G2(V).l 


Classroom Size Var. 


62.85 


25.85 




-.01 


.04 


.02 


.55 


G2(V).2 


Classroom MPA Heter 
Variation 


• 

.06 


.08 


-.01 




-.05 


.03 


-.47 


G2(V).3 


Classroom Homework 
u£tsr. Var. 


5.75 


1.88 


.04 


-.05 




.02 


.71 


G2(V).4 


Classroom Expect. 
Heter. Var. 


7.50 


2.98 


.02 


.03 


.02 


— 


.21 
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TABLE 6-A-8 



CLUSTER AHALYSIS RESULTS FOR 
SCHOOL MEDIATION— SATISFACTION 



Cluster 


Mean 


S. D. 


Correlations 


Principal 

CoDqK>nent 


H1(M). Mean Attitude Toward Study 
(51. 3Z of the variance 
was extracted) 

Hl(M).l Student Attitude* 














H1(M).2 HoBMWorfc 


2.96 


.36 


— 


.03 




.72 


HI (M) . 3 Parent Attitude 


2.00 


.14 


.03 


— 




.72 


Hi(V). Variation of Attitude Toward 
Study (38. 6Z of the variance 
was extracted) 

H1(V) . 1 Stud « Attitude. Var. 


.64 


.14 




-.09 


.05 


.56 


H1(V).2 Honework Variation 


2.57 


.46 


-.09 


— 


-.10 


-.69 


H1(V).3 Parent Att. Var. 


1.03 


.12 


.05 


-.10 


— 


.61 


H2(M). Mean Social Interaction 
(41«7Z of the variance 
was extracted) 

H2(H)*1 Unpleasant Exper. 


2.16 


.14 




-.07 


.03 


-.12 


H20l).2 Teacher Attitude 


3.29 


.13 


-.07 


— 


.25 


.80 


H2(M).3 Activities 


2.78 


.50 


.03 


.25 


— 


.77 


H2(V). Variation of Social Inter* 
action (47. 6Z of the 
variance was extracted) 

H2(V).l Unpleasant Exper- 
ience Var. 


.68 


.16 




.30 


.17 


.75 


H2(V).2 Teacher Attitude 
Var. 


.68 


.19 


.30 




.16 


.74 


H2(V).3 Activities Var. 


2.83 


.66 


.17 


.16 


— 


.57 


*0nltted 
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Ability. Ihe analytic results for ttese clusters 
Table 6-4-9. All clusters have high and honogeneous ^ 

relations; the correlations are about .70 for mean-level ^ 

about .AO for variation clusters, and the principal 
are about .90 for mean-level clusters and about .80 for 
Althou^ thU seems hl^. It Is necessary to raise a “ ™ ® 

analyses. At uhat stage should the test scores ^e *“ 

analyzed. The approach used of clustering school «eaM 
was dictated by the need to maintain analytic 

analyses. Honfever, it is possible that some of the factoria P 

present within the ITED battery could '*®**®' 

a differently staged analysis been selected, ftesumably the ^ 
present In the battery are Indlvlduallstlcs; they 
detected by dealing with the student by test 

derived could have been averaged over schools to obtain ®<*^ *®“ * 

Ihe analysis as performed probably blurred the actual ®™P*-®“*^ 
battery. This will be observed later in the image analysis or tn 

student output cluster. 

Aspiration. The analytic results for these 
in Table 6-A-lO. The mean aspiration cluster yields a compMent imich 
is correlated only with educationcl aspiration end discrepancy, ^ither 
of those two indicators correlates with girls aspirations. e ® . 

principal component correlates with the essential 
in aspiration indicators have lower but more homogeneous 
The principal component correlates highly with all three indicate » 
thus must be a reasonable representation of this variation. 

Br«M»nsl Outlooks. The analytic results for this cluster are 
presented in Table 6-A-ll. Both clusters hwe extreroly . 

indicator correlations, and, consequently, the principal “"J® . . ^ 

failed to provide reasonable composites. These clusters must therefore 

be omitted from consideration. 



B. Factor Structures 



The principal component analysis of the clusters was described in 
the previous section. From each cluster of indicators, a princ pa 
component scale was defined. Computationally, the next step ^ 
compute for each school a principal component score for e ad. clu ster. 
This may be considered as a reduction of varUbles— as ® 
o^ratiL. The resulting scores were used as input for image 
Separate image analyses were performed for each “'*® 

clusters: Community Ihput, School Mediation, and Student Output. 

In this section, the results of those analyses are reported. F<« 
each analysts, the correlation matrix of the clusters is presente d. A 
table is then presented which contains the major outputs of “ * . 

analysis. The anti-image variance of a cluster is the proportions 
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TABLE 6-A-9 



CLUSTER AHALYSIS RESULTS FOR 
STUDENT OUTPUT— ABILITY 



Cluster Mean 


S. D. 


Correlations 


Principal 

Component 


11 (M). Mean Verbal Ability 

(80. 8Z of the variance 
was extracted) 












Il(M).l Expression 19.05 


1.23 


— 


.72 


.67 


.89 


I1(M).2 Vocabulary 19.24 


1.29 


.72 


— 


.74 


.91 


11(H). 3 Literature 18.94 


1.47 


.67 


.74 


— 


.90 


11 (V). Variation of Verbal Ability 
(60. 3Z of the variance 
was extracted) 












Il(V).l Expression Var. 24.51 


7.49 


— 


.39 


.37 


.75 


I1(V).2 Vocabulary Var. 26.73 


8.08 


.39 


— 


.45 


.80 


11 (V). 3 Literature Var. 33.21 


9.38 


.37 


.45 


— 


.79 


12 (M). Mean Science and Quantlfi^ 
cation Ability (78. 5Z of 
the variance was extracted) 












I2(M).l Science Background 20.36 


1.42 


— 


.66 


.70 


.89 


I2(M).2 Quantification Bkgd.19.39 


1.70 


.66 


— 


.68 


.88 


I2(M).3 Science Interp. 20.00 


1.73 


.70 


.68 


— 


.90 


12 (V). Variation of Science and Quan- 
tification Ability (64. OZ of 
variance was extracted) 












I2(V).l Science Bkgd. Var. 26.46 


9.05 


— 


.46 


.49 


.82 


12 (V). 2 Quant. Bkgd. Var. 41.64 


10.83 


.46 


— 


.42 


.78 


I2(V).3 Science Interp.Var. 44.53 


12.32 


.49 


.42 


— 


.80 


13 (M). Mean Social Science Ability; 
(89. 2Z of variance was ext.. 












I3(M).1 Social Concepts 19.81 


1.51 


— 


.76 


.87 


.93 


I3(M).2 Social Science Int. 19.38 


1.73 


.76 


— 


.88 


.93 


13(H). 3 Inform. Source Util. 21. 03 


1.59 


.87 


.88 


— 


.97 


13 (V). Variation of Social Science 
Abillty(76.0Z of the vari- 
ance was extracted) 












I3(V).1 Social Concepts Var. 32.72 


9.70 


— 


.50 


.77 


.87 


13 (V). 2 Soc. Sci. Interp. Var. 40. 88 


10.08 


.50 


— 


.65 


.81 


I3(V).3 Inf. Source Util. Var. 39. 21 


10.69 


.77 


.65 


— 


.93 



TABLE 6-A-lO 



CLUSTER ANALYSIS RESULTS FOR 
STUDENT OUTPUT— ASPIRATION 



Cluster Mean S. D. Correlations Principal 

Component 



J1(H). Mean Aspiration 

(53.7% of the variance 
was extracted) 



Jl(M).l Educational 

Aspiration 


6.44 


.67 


— 


.60 


.06 


.89 


J1 (M) . 2 Educational 

Discrepancy 


3.83 


.72 


.60 


— 


.06 


.89 


J1(M).3 Girls* Aspirations 


3.40 


.30 


.06 


.06 


— 


.17 


Variation in Aspiration 
(54.2% of the variance 
was extracted) 














Jl(V).l Educational Aspir- 
ation Variation 


6.97 


1.50 


— — ^ 


.48 


.23 


.82 


J1*(V).2 Educational Die- 

crepancy Varia- 
tion 


9.00 


.254 


.48 




.20 


.80 


J1(V).3 Girls' Aspirations 
Variation 


.81 


.44 


.23 


.20 




.56 






•‘fit. t lffi ^ ' $ (;' ^nf | m ^^p^ 



TABLE 6-A-ll 

CLUSTER ANALYSIS RESULTS FOR 
STUDENT OUTPUT-EMOTIONAL OUTLOOKS 



Cluster 


Mean 


S. D. 


Correlations 




Principal 

Component 


K1(M). Mean Adjustment 

(28.1Z of the variance 
was extracted) 
















Kl(M).l Markpoint Variance 


2.34 


1.55 


— 


.02 


.07 


-.05 


.51 


K1 (M) . 2 Aspirational 

Discrepancy 


.81 


.41 


.02 


— 


.02 


-.10 


.61 


K1 (M^ . 3 Uncertainty 


9.24 


6.02 


.07 


.02 


— 


.02 


.30 


R1(M).4 Capacity Learning 


.73 


.10 


-.05 


-.10 


.02 


— 


-.63 


K1(V). Variation of Adjustment 
(60. OZ of the variance 
was extracted) 
















Kl(V).l Markpoint Variance 
Variation 


18.42 


28.55 


— 


-.04 






-.72 


K1(V).2 Aspirational Dis- 
crepancy Varia- 
tion 


3.72 


1.76 


-.04 


— 






.72 
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cluster score's variation which is not predictable from the other scores* 
This proportion is related to the relative uniqueness of the score* The 
Harris roots of G (the image covariance matrix) are also given* The 
final roots idien large, indicate that a poor sample of the content domain 
has been achieved* The rotated image factor matrix is given* For each 
factor, a factor sum of squares is given»-this is simply the sum of squares 
of the loadings on the factor, and it indexes the relative importance of 
the factor* In the accompanying texts, some interpretations and evalua- 
tions of the analyses are given and an attempt is made to title the 
factors* For notation, the codes CI-1, CI-2, ***, CI-8 will be used to 
identify the community input rotated factors (the order is as given in 
Table 6-B-2)* The codes SM-1, SH-2, ***, SH-7 will be used to identify 
school mediation rotated factors (the order is as given in Table 6-B-4)* 
The codes SO-1, SO-2, , SO-5 will be used to identify the student 

output rotated factors (the order is as given in Table 6-B-6)* 



Comounitv Input 

I 

The correlation matrix for the 13 coumunity input cluster scores 
is given as Table 6-B-l, and the image analysis results are given as 
Table 6-B-2* Host of the scores have relatively low anti-image var- 
iances; i*e*, most of the scores share a common variation except for 
B3 (socioeconomic variation), D4 (farming), El (archive usage), and « 

E2 (educational involvement)* As i;kOted in the previous section, El sihd 
E2 were unusual in their principal components analysis; El consisted of 
just one probably inaccurate indicator and E2 consisted of two essentially 
uncorrelated indicators* The last Harris roots are relatively small 
compared to the first roots* This indicates that the anti-image corre- 
lations are small and that a reasonable approximation to a universe of 
content has been achieved* The rotated image factor matrix contains four 
important factors, CI-1 to CI-4. The factor structure is simple* Ten of 
the variables have one large loading on the first four factors; two var- 
iables have two large loadings; two factors have one moderate loading, 
and one variable (El) has no substantial loadings* 

The first factor, CI-1, has high loadings on 32 (socioeconomic level). 
Cl (social act ivity/ involvement ) , C2 (social mobility), D2 (industry), 

D3 (trade), and negatively on B1 (urban)* This factor can be identified 
as urbanization* Since the data are from Iowa, the characteristics of 
urbanization are related to the particular urbanization patterns of Iowa* 
For example, there are few big cities, and consequently fewer overwhelming 
urban problems such as poverty. The people in areas with high scores 
on this factor tend to have high socioeconomic status, hl^ social 
mobility and activity levels, and to be involved in industry or trade* 

The second factor, CI-2, may be called vitality* The loadings are 
high with A1 (growth), A2 (youth), A3 (density) and Dl (wealth)* The 
areas with hi^ scores on this factor are achieving population and 
economic groi-nrh* 

The third factor, CI-3, is difficult to interpret* It correlates 
with B3 (socioeconomic variation), D3 (trade), E3 (educational attain- 
ment), and negatively with B1 (rural /urban)* Also there are five smaller 
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ERIC 



*The entries are multiplied by 100 and rounded 



loadings greater than .25. A reasonable hypothesis would seem to be that 
areas with high scores on this factor are small urban towns located as 
trade centers in rural areas. The high variaticm in socioeconomic 
standing would then be due to the combination of farmers and tradesmen. 
The high degree of educational attaixmient would be due to town-based 
professional persons serving the siurrounding area. The smaller load- 
ings, such as with Cl (social activity/involvement), D1 (wealth), and 
El (archive usage) seem to verify this hypothesis. A reasonable title 
for the factor might then be trade centers. 

The fc-urth factor, CI-U, may be called wealth where that is 
intended in the sense of level or standard of living. The high load- 
ings are with B2 (socioeconomic level), D4 (fanning), and negatively with 
C3 (poverty culture). 



School Mediation 



The correlation matrix for the school mediation cluster scores is 
given in Table 6-B-3, add the results of the image analysis are given 
in Table 6-B-4. As noted in the previous section, the measurement of 
these clusters was isqperfect. In the correlation matrix there are <mly 
two substantial entries: the correlation between B1 (M) (mean teacher 
esqperience) and B1 (v) (variaticm of teacher experience) is .U8 and the 
correlation between G2 (v) (variation of classrocm heterogeneity) and 
ffl. (V) (variation of attitude toward study) is -.45. The anti-image 
variances are all hi^, which indicates that most of the variance of 
each variable is unique, i.e., unrelated to the other variables. The 
Hurris roots are very low, indicating that there was not much coomonality. 
None of the rotated factors have large variances: i.e., none are 

important. Ifowever, in order to satisfy the needs of analysis in the 
next section, the first five factors were retained. 

The first factor, SM-1, has loadings on G1 (M) (mean teacher 
experience) and 61 (V) (variation of teacher experience). It will be 
called teacher experience. The sec<md factor, SM-2 has a positive load- 
ing on HI (V) (variation of attitude toward study) and negative loadings 
on 62 (v) (variation of classroom heterogeneity) and HI (m) (mean atti- 
tude toward study). It will be called conflict, which is suggested Iqr 
the attitude clusters. The third factor, SM-3> has positive loadings 
with HI (M) (mean attitude toward study) and B2 (M) (mean social inter- 
action) and negative loadings with G2 (m) (mean classroom heterogeneity) 
and H2 (m) (variation of social intereiction). It will be called comfort, 
which is intended to imply a pleasant social enviroiment. The fourth 
and fifth factors are related to the stimulation substance clusters, 
which were totally obscure, and will not be titled* 



Student Output 

The correlation matrix for the student output cluster scores is 
presented as Table 6-B-5 and the results of the image analysis are 
presented as Table 6-B-6. The inter correlations for the clusters that 
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TABLE 6*B-5 

INTERCOBRELATIONS* OF STUDENT OUTPUT CLUSTER SCORES 




IMAGE ANALYSIS RESULTS*FOR STUDENT OUTPUT 




were derived from the ITED test battery are generally high and the anti- 
image variances are correspondingly low. The other clusters have low 
correlations and hig^ anti-image variances. The last Harris roots are 
relatively high, which indicates that part of the ccntent domain was not 
sampled well. Presumably, this part has to do with emotional outlooks 
of departing seniors. Three rotated factors were retained. The third 
has a small variance but represents what can be sho^’n of the aspiration 
clusters . 

The first factor, SO-1, loads highly on the three mean-level test 
battery cluster's, II (m), 12 (m), and 13 (m) cmd also has a small loading 
with J1 (m) (mean aspiration). This factor will be c allied achievement, 
intended in its academic sense. It should be noted again that because 
of the clustering procedure, any factorial structure within the test 
battery has been smoothed out. However, this factor is doubtless a 
stable measure of ability or achievement level. 

The second factor, SO-2, loads highly on the three variation test 
battery clusters, II (V), 12 (v) and 13 (V). Again, any factorial 
complexity of the test battery segments tos been smoothed out. Schools 
with high values on this factor have a high variance of scores on the 
tests: probably many students do rather poorly and many do quite well. 
This factor will be called academic heterogeneity. 

The third factor, S0-3» has, as noted, a very low variance. Tet 
it is retained because it has loadings with the aspiration clusters. 

In particular, it loads positively cm J1 (m) (mean aspiration) and 
negatively on J1 (v). (There is alsa a loading on Id (m), but that 
cluster is uninterpi^^^ble.) This factor will be call^ aspiration. 

Note that schools with high scores on this factor tend to have hig^ 
aspiration levels and low variance in aspiration. 



C. Regression Structures 



Analyses of the relaticmships among the image factors described in 
the last secticm and of the relationships between those factors and 
high school size are presented below. The analyses do not correspond 
to the full complexity of the theoretical substantive framework described 
in Section 5 -A. That framework consisted of a series of interrelated 
and directed relationships. Community Input clusters were supposed to 
represent processes which directly influence the characteristics of 
students entering high school and which exert a continuing influence 
over students while they are in school. The Student Output factors were 
supposed to represent the characteristics— academic, social, and 
emoti(mal— of the graduating class. The School Mediaticm factors were 
supposed to represent the processes occurring in a school which deter- 
mine the transforming properties of the school and which, along with 
community processes, produce graduates. Within this context, the 
single School Manipulation variable— namely, size— was considered to 
influence the School Mediation processes and in turn to relate to the 
other clusters. 
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The School Mediation cluster measurements are critical to such 
analysis. These measxirements, it was noted, %iere imprecise. Conse- 
quently, a simple set of analyses uas designed and made. In particular, 
regression analyses were performed in order to indicate what sorts of 
Student Output characteristics are independently related to the Community 
Input and School Mediation factors and ' ) size. In addition, contour 
maps were prepared to give a better idea of the precise arrangement of 
some of the relationships. The School Mediation factors were included 
in both these analyses in spite of the fact that their meanings are still 
unclear. 

The shortcomings in the data and in the earlier analysis meant that 
this final stage in data analysis was not developed as it might have 
been. In particular, no substantive conclusions will be offered. 



Correlation Analysis 

The inter -correlations of the factors and size are presented in 
Table 6-C-l. Within each supercluster — Community Input, School Mediation, 
and Student Output— the factors are completely uncorrelated. The correla- 
tion matrix is blocked according to size and the three sets of image 
factors. The largest correlation is between size and urbanization: big 
schools are usually found in urban places. All the other correlations 
are quite low — only two are above .30. It should be noted that size 
correlates moderately with almost all the variables. 



Regression Analysis 

Within each supercluster the factor scores are automatically un- 
correlated. In multiple regressicm with uncorrelated predictor variables, 
the amount of variance accounted for by the set of predictors is exactly 
the sum of the amounts of variance accounted for by the individiial pre- 
dictors. (This is not true for correlated predictors.) Thus, in the 
present regression analysis, both groups of independent factors could 
be manipulated as though they were single variables. 

Each of the three Student Output factors was considered separately. 
For each factor, seven regressions were run, corresponding to the seven 
combinations of the three groups of predictor factors, including size. 

The results are presented in Tables 6-C-2 to 6-C-4. For each regression 
equation, there is a column giving the standeird regression coefficients. 
The degree of determination achieved by each equation is simonarized in 
Table 6-C-5. For each equation, the coefficient of determination (squared 
multiple -correlation) is presented. All these coefficients are small; no 
more than 13t of the variance in Student Output is ever achieved. It 
should be kept in mind, therefore, that in the following discus si<xis of 
differences in the effect of groups of independent variables;, only a 
small part of the total variation of Student Output is in question. 
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Note: Entries which are automatically zero are printed without a decimal point. 









TABLE 6-C-2 



REGRESSION WEIGHTS USING 
DEPENDENT VARIABLE SO-1 
(ACHIEVEI'IENT) 







Size Cl 


SM 


Size 

+ 

Cl 


Size 

+ 

SM 


Cl 

+ 

SM 


Size 
+ Cl 
+ SM 




Size 


.19 




.15 


.19 




.16 


(Urbanization) 


CI-1 


.03 




-.06 




-.05 


-.11 


(Vitality) 


CI-2 


-.03 




-.05 




-.01 


-.04 


(Trade Center) 


CI-3 


.28 




.23 




.25 


.20 


(Wealth) 


CI-4 


.16 




.15 




.14 


.12 


(Teacher Experience) 


SM-1 




.11 




.06 


.10 


.08 


(Conflict) 


SM-2 




.17 




.14 


.09 


.09 


(Comforc) 


SM-3 




-.07 




.02 


-.06 


-.01 




SM-4 




.02 




.07 


.01 


.04 




SM-5 




.02 




-.02 


.02 


-.01 
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TABLE 6-C-3 



REGRESSION WEIGHTS USING 
DEPENDENT VARIABLE SO-2 
(ACADEMIC HETEROGENEITY) 







Size 


Cl 


SM 


Size 

+ 

Cl 


Size 

+ 

SM 


Cl 

+ 

SM 


Size 
+ Cl 
+ SM 




Size 


.21 






.32 


.19 




.30 


(Urbanization) 


CI-1 




.05 




-.14 




-.11 


-.22 


(Vitality) 


CI-2 




.08 




.03 




.10 


.06 


(Trade Center) 


Cl >3 




.00 




-.10 




-.06 


-.14 


(Wealth) 


CI-4 




.04 




.02 




.01 


-.01 


(Teacher Experience) 


SM-1 






.10 




.05 


.15 


.11 


(Conflict) 


Sm-2 






.08 




.04 


.12 


.12 


(Comfort) 


SM-3 






-.13 




-.04 


-.18 


-.09 


— 


SM-4 






.00 




.05 


-.02 


.04 




SM-5 






.23 




-.03 


.03 


-.02 
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TABLE 6-C-4 






REGRESSION WEIGHTS USING 
DEPENDENT VARIABLE SO-3 
(ASPIRATION) 











Size 


Size 


Cl 


Size 










+ 


+ 


+ 


+ Cl 






Size Cl 


SM 


Cl 


SM 


SM 


+ SM 




Size 


-.21 




-.22 


-.21 




-.21 


(Urbanization) 


CI-1 


-.16 




-.03 




-.10 


-.03 


(Vitalicy) 


CI-2 


-.09 




-.06 




-.10 


-.06 


(Trade Center) 


CI-3 


.02 




.09 




.02 


.08 


(Wealth) 


CI-4 


.05 




.07 




.04 


.06 


(Teacher Experience) 


SM-1 




-.02 




.04 


.00 


.04 


(Conflict) 


SM-2 




.03 




.06 


.03 


.03 


(Comfort) 


SM-3 




.21 




.11 


.17 


.10 




SM-4 




.06 




.01 


.04 


.00 




SM-5 




.06 




.11 


.08 


.12 
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TABLE 6-C-5 



SUMMARY OF THE COEFFICIENTS OF 
DETERMINATION FOR THE REGRESSION ANALYSES 







Dependent Variables 




Factors 


Achievement 

SO-1 


Academic Heterogeneity 
SO-2 


Aspiration 

SO-3 


Size 


.037 


.045 


.045 


Connnunlty Input 


.105 


.010 


.035 


School Mediation 


.045 


.034 


.050 


Size and Community Input 


.118 


.066 


.061 


Size and School 
Mediation 


.065 


.055 


.075 


Community Input and 
School Mediation 


.120 


.053 


.069 


Community Input, Size, 
and School Mediation 


.130 


.090 


.088 
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Examining the first column of Table 6-C-5, it can be seen that the 
Community Input factors account for most of the variance that can be 
accounted for in SO-1 (achievement). Although size and School Mediation 
do have some effect^ most of their effect overlaps with that of Community 
Input. Community Input accounts for 10.5^ of the variance of achieve- 
ment. The addition of size and School Mediation to the equation increases 
the percentage to 13.0$. The net increase is 2.^$ which is less than 
half of the total of 6.^$ that size and School Mediation account for. 
Looking at Table 6-B-2, it can be seen that factor CI-3 (trade center) 
accounts for much of Community Input's effect. Also, Cl -4 (wealth) is 
related. It can also be seen that size does have an effect. The last 
column of Table 6-B-2 shows a regression coefficient of .16 for size in 
the equation including all the variables. 

The second column in Table 6-C-5 gives the coefficients of deter- 
mination in the prediction of SO-2 (academic heterogeneity). Maximum 
coefficient (9$) is about a third smaller than maximum coefficient (13$) 
for the previous factor. Also, Community Input is a suppressor variable 
group for both size and School Mediation. Size and Comm\mity Input 
account together for more than the sum of what they account for separately. 
Exactly which group has the greatest impact on academic heterogeneity is 
not clear. In Table 6-B-3 it can be seen that SM-5 has a high positive 
weight when only School Mediation is included, but it has a very small 
weight when other factors are included. Other reversals seem to take 
place and the regression structure is undecipherable. 

In the third column of Table 6-C-5, it is seen that 8.8$ of the 
variance is accounted for by all the factors in predicting SO-3 (aspira- 
tion). Each factor accounts for about the same amount of variance, and 
the accumulation is approximately linear. Each factor, including size, 
seems to account for some unique variation. Size and the two untitled 
School Mediation clusters seem to have the greatest effect. 



Contour -Plotting Analysis 

The correlations found in the regression analysis were disappointingly 
low and some contour -plotting checks were made to determine whether non- 
linear relations or mixture distributions were depressing the correlations. 
The contour-plotting theory and operations were explained in Section 5“D* 

In Figure 6-A-l is presented a contour plot of the joint frequency 
distribution of CI-1 (\irbanization) and CI-2 (vitality). By their con- 
struction, these factors have a correlation of zero. But it is clear 
from the plot that they do not have a bivariate normal distribution. 

This might be a mixture distribution, with a positively correlated part 
(seen in the inner lines) and a negatively correlated part (seen in the 
outer lines). If that is true, the use of the two variables as two 
independent dimensions of community character is misleading. 

A better example of a mixture distribution with confusing correla- 
tion is given in Figure 6-C-2, This is the contour plot of size (verti- 
cally with high values at the bottom) and SM-3 (comfort). There are 
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FIGURE 6-C-l 

CONTOUR PLOT OF THE JOINT FREQUENCY 
DISTRIBUTION OF FACTORS 
CI-1 AND CI-2 




FIGURE 6-C-2 

COMTOUR r’LOT OF THE JOINT FREQUENCY 
DISTRIBUTION OF SIZE AND 
FACTOR SM-3 
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clearly two trends: for small schools the trend is lower-left to upper- 

right and for large schools, the trend is lower-right to upper -left. 

These trends are contradictory and the correlation is therefore depressed. 

These two examples suffice to illustrate the utility of the countour- 
plotting approach developed during the project. 



0« Concluding Statement 



The purpose of this research was to relate the size of secondary 
schools to certain costs which ere not being considered when new edu- 
cational facilities are planned and built* These costs were measured 
in terms of educational p.rcdactivity, which in turn measured along 
a wide spectrum of outcomes ranging from academic progress to social 
behavior* 

Three long-range objectives were Initially proposed: (1) to deter- 

mine the interrelationships of ccmmunity, school, and educational output 
f.^-ctors as influenced by size of secondary school; (2) to apply these 
irterrelationsbips to a reoresentative group of educaticmal situations; 
and (3) to disseminate findings to school officials for use in making 
decisions about size of secondary schools* 

This report covers phase one of the project* The specific objec- 
tive of this phase has been to develop a suitable methodolo^ for 
subsequent applicaticn to a national sample* The population selected 
for study in phase one consisted of the public secondary schools of Iowa 
and Florida* The sample was subsequently narrowed to the high schools 
of Iowa* 

A theoretical framework was constructed, into idiich selected var- 
iables were set* Since many of these variables were not directly 
measurable, indicators were drawn from the information found in data 
banks* The data were clustered according to the theoretical framework 
and analyses were then performed* 

Five conclusions may be drawn from the work completed in phase one: 

1* It may be less costly in time and money to gather data for 
intended research uses than to attempt to convert or adapt data gathered 
for other purposes in order to generalize the solution of operational 
problems . 

2* The compiling of data on social characteristics of educational 
units (attendance or administrative) has generally been neglected by all 
levels of government, incliiding school districts* The data which are 
maintained have immediate and practical use in matters of management, 
but they are insufficient in scope and often in unsuita**le form for 
research aimed at solutions to problems of a general nature* Yet, these 
social characteristics are becoming more and more important to know 
consider in educational planning. 
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3. The cen factors extracted in this research may be useful in 
describing principal elements in the process by which community inputs 
become student outputs via school mediation. However, they probably 
do not yet provide a reasonably complete description of the social 
phenomena because of the character and coverage of the data used. 

4. Sise of secondary school was found to be significantly related 
to one factor which is itself a function of size and density, nasiely 
urbanization. 

5. The theoretical framework, as originally constructed, was 
intended to describe the relationship in its complexity. Yet, as has 
been indicated, the data were inadequate for proper testing of the 
theory and modifications had to be made in the clustering of the data. 
These modifications contributed to the reduction in value of principal 
component analysis. 

In the next phase of this research, a national sample of secondary 
schools will be drawn. Data appropriate to the indicators set in the 
theoretical framework will then be gathered by direct means and analyzed 
by the method developed in this first phase. A general answer can then 
be given to the consequences of hi^ school size 4snder investigaticm 
in this project. 
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APPENDIX 




APPENDIX A 



SCHOOL AND COMMUNITY UNITS* 



While other units of local government, particularly the special 
purpose districts, have increased steadily in numbers, the local school 
district has decreased phenomenally. Since 1932, the total number of 
school districts has declined from 127,649 to just above 25,000. The 
decline is due almost exclusively to the reorganization of rural school 
districts, most of which were operating a one-teacher elementary school 
or no school at all. In the intervening 35 years the population of the 
U.S. has increased more than 50^. The number of local governmental 
units in 1^2 by type for the U.S. and for Iowa and Florida are given 

below : 



coimties 

municipalities 

townships 

special districts, exclusive of 
school districts 
school districts, exclusive of 
2341 public school systems 
operated as part of other 
governmental units 

total 



u. s. 


Iowa 


Florida 


3043 


99 


67 


18000 


944 


366 


17142 


- 


•• 


18323 


263 


264 



34678 


1336 


67 


91186 


2642 


764 



Most of the decline since 1952 in numbers of school districts has 
been confined to ten states: Illinois, Iowa, Kansas, Michigan, Minnesota 

Mississippi, Missouri, Nebraska, New York, and Wisconsin. Moreover, 
most of the change was legislated at the state level. 

As of i960, the average area in square miles of school districts in 
middle western states was under 50 • It was over 200 in southern states, 
and over 1000 in western states. 

The relationships of school district boundaries to those of other 
units of local government are quite varied. In three states (Florida, 
Nevada, and West Virginia) the independent school units are coterminous 



*U.S. Department of Commerce, Statistical Abstract ef the ^g, . , 
89 th Edition. Washington, D.C. : Government Printing Office, 19bb. 

pp. 413-416. 




with counties. In five others (Alabama, Georgia, Kentucky. Louisiana 
and Utah) the independent school unit boundaries are generally but not 
universally coterminous with counties. In four states (Indiana* New 
Hampshire, New Jersey, and Pennsylvania) school units are coterminous with 
townships or towns. In all others, there is only occasionally a coinciden- 
tal relationship with other units of local government. Even the earlier 
identification of school districts with municipalities has been largely 
lost. 



Education may be described as a function of population. Consequeir^iy, 
density of population controls to a considerable extent the geographic 
spread of school units. For example, one school district in Nevada 
(F.lko County) exceeds by almost 3000 square miles the geographic area of 
562 school units in Connecticut, Massachusetts, and Rhode Island combined. 
The relative enrollments are 29 OO and 1.2 million. 

States may be classified according to the numbers of different 
types of school units found in each, as follows; 



one type only 


k 


two types 


13 


three types 


15 


four types 


6 


five types 


7 


six types 


3 


eight types 


2 



In some states, the classification is based on governmental units 
such as municipalities, towns, and counties. In others, it is based 
at least in part on units of population such as total population of 
the district or school census. In yet others, school grade organization 
is used to distinguish units (e.g., elementary, high school, junior 
college). In most states, as shown above, no single method is used. 

The definitions given below are useful in bringing the problem of 
mapping school districts in terms of census into proper perspective. 

The definitions are divided into two sets, one clarifying the school 
units, and the other, the census units. In general, the two sets of 
units are not geographically coterminous. 



Definitions of School Iftiits 



A. SCHOOL DISTRICT; A school district is generally a local unit of 
government, possessing quasi corporate powers established or 
empowered by state law to conduct and administer a public school 
or a system of public schools. The term includes all types of 
school districts, such as common, city, independent consolidated^ 
separate high school, union, community township, and county unit 
school districts. The term "administrative unit" is often used. 
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1. A COMMON SCHOOL DISTRICT is an autonomous local subdivision 
created only for school purposes and operated by a board or 
school wi:’icial. It usually conducts only one school. 

2. A COMMUNITY SCHOOL DISTRICT is of the same nature as a common 
school district. Its distinguishing characteristic is that it* 
comprises the territory of a whole community usually composed of 
a village, town, or city and its tributary trade and service 
are^i of smaller population centers and open country. 

3. A CITY OR INDEPENDENT SCHOOL DISTRICT is one that comprises at 
least the area of an entire city. It usually is a separate quasi 
corporation Independent of the general municipal government. 

. A TOWN OR TOWNSHIP SCHOOL DISTRICT is one that is coterminous 
with a politic il town of a New England state, or with a political 
township as in Indiana, Pennsylvania and New Jersey. Such dis- 
tricts are usually independent of the town or township government. 

5. A COUNTY SCHOOL DISTRICT-COUNTY UNIT is one that is coterminous 
with a civil county. Such districts are not a part of general 
county government and are usually largely or wholly autonomous 
in the administration of schools. Such districts are of two 
types : 

a. A COMPLETE COUNTY UNIT is a district which comprises an 
entire county; and 

b. A PARTIAL COUNTY UNIT is a district that comprises all of 

a county outside of one or more city or independent districts. 

6. A SEPARATE HIGH SCHOOL DISTRICT is one responsible only for 

administering a high school. Such a district is superimposed 
upon the territory of one or more elementary school districts 
and is usually not coterminous with any other political sub- 
division. In some instances, however, it is coterminous with a 
township as in Illinois and Wisconsin. The term HIGH SCHOOL 
includes ony type: the traditional 4-year high school, 2-year 

or 3~year senior high schools, or 6-year or 5"year high schools. 

B. ATTENDANCE UNITS: An attendance unit or school unit comprises the 

geographical area and its population served by a single school. In 
larger districts it is part of a cchool district. 



Definitions of Census Units 
A. GENERAL 

1. URBAN-RURAL— An urban population comprises all persons living in 
(a) places of 2,500 inhabitants or more incorporated as cities, 
boroughs, villages and towns; (b) the densely settled urban fringe, 
whether incorporated or unincorporated, of urban areas; (c) unin- 
corporated places of 2,500 inhabitants or more. 
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2. PLACES— Concentrations of population, regardless of the existence 
of legally prescribed limits, powers, or functions. Most of the 
places listed are incorporated as cities, towns, villages, or 
boroughs, however. The larger unincorporated place outside the 
urbanized area was delineated and those with a popi^tion of 
1,000 or more are presented in the same manner as incorporated 
places of equal size. 

3. ENUMERATION DISTRICT— A small area assigned to an enumerator 
whicn must be cc.T/assed and reported sepajotely. In most cases 
an ED contains a_jproximately 250 housing vji:-.ts. The boundaries 
for the ED*s for the most part follow such features as r^ds, 
streets, railroads, streams, and other clearly defined lines 
which may be easily identified by census e.ranerators in the field 
and often do not conform to the boundaries of political units. 

B. SPECIFIC 

1. COUKTIES — The primeiry divisions of the States except for Louisiana 
where the divisions are called parishes and in Alas ka where the 
data are collected for election districts. 

2. MINOR CIVIL DIVISIONS— The primary political divisions into which 
counties are divided. Where more than one type of primary 
division exists in a county, the bureau uses the more stable 
division so that comparable data are available from census to 
census (School, taxation, election units, etc, are not considered 
stable.). 

3. STANDARD METROPOLITAN STATISTICAL AREAS (SMSA)— A unit whose 
entire pojmlation is in and around the city whose activities fora 
an integrated social and economic system. Except in New England, 
an SMSA is a county or group of contiguous counties which contau 
at least one city of 50,000 inhabitants or more or "tw^ cities" 
with a combined population of at least 50>000. In addition to 
the counties containing such a city or cities, ccmtiguous 
counties are included in an SMSA, if. according to certain 
criteria they are essentially metropolitan in character and are 
socially and economically integrated with the central city. 
Criteria are listed on page viii of PC(l), 

4. URBAN — An urbanized area contains at least one city of 

50,CC0 j.rhabi ir.ants or more in 19^0, as well as the surrounding 
closely fettled incorporated places and unincorporated areas 
that meet the criteria listed on page vii of BULLETIN PC(l). 

5. URBAN PI -ACES — All incorporated and unincorporated places of 
2.5CO ii.luibitants or more, and the towns, townships, and counties 
classified as urban. 

6. INCOrJCRAT^lO PLACES— Places which are incorporated as cities 
boroughs, towns, and villages with the exception that towns are 
not recognized as incorporated places in the New England states, 
New York and Wisconsin. 
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7. UNINCORPORATED PLACES— Densely settled population centers without 
corporate limits. 



Census information is divided in three different ways as follows: 

A. MAJOR SUBJECT FIEIDS 

1. Population 

2. Housing 

3* Agriculture 
Business 
5 • Manufactures 
6 . Mineral Industries 
7 * Transportation 
8. Governments 

B. GEOCHiAFHIC AREAS 

1. United States 

2. Regions and Divisions 
3 • States 

U . Counties 

Standard Metropolitan Statistical Areas 
6. Minor Civil Divisions 

7< Cities and Other Incorporated and Unincorporated Places 
8. Urbanized Areas 
9* Census Tracts 

C. RETAIL TRADE AREAS 

1. Central Business Districts 

2. Major Retail Centers 

3 . City Blocas 

4. Enumeration Districts 

5» Foreign Trade Statistical Areas 
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APPENDIX B 



ILLUSTRATION OF THE PROCESS OF CONTENT ANALYSIS 



Ths followins pags of this appen':^!^^ contains the fil3 docunenta" 
tlon for loya File This foldout traces a completed data iten 

file sheet for one Iteia -• "seiaester hours”* The steps in completing 
the file sheet \jere: 

a) The unit which the item describes— teacher— was inferred 
from the file identification page since the name of the 
file is Teacher Bnployment. The unit was recorded on the 
data item file sheet and applied to all items in the file* 

4 

b) The item ”Sem Hrs" was located as the next item to be pro- 
cessed on the information layout form. The abbreviated 
title of the item was copied on the file sheet. (All other 
items in File DB-lS were also first located on this form.) 

c) The item was located on the source form. The coding system 
and the source used in its determination were recorded* 

The meaning of the ittan was generally clear -- a teacher's 
total semester hours of college credit earned. 

d) A search was made for a more precise description of the 
item. The information was found on page 5 of the Iowa 
Educational. Data Bank User's Manual. It was assumed that 
the item in the manual and the item being processed were the 
same even though the manual described the iton as having no 
coding table* A key-word title was constructed and recorded 
according to the criteria presented in Chapter U Section B. 
The complete description and source were recorded. 

e) The jdjysical information was recorded from the file 
identification j»ge. 
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APPENDIX C 



THE MAPPING PROCEDURES 



Preliminary Tests 

The Iowa State Department of Public Instruction has prepared a map 
of the state with all secondary school districts indicated in a color 
code. The state map was constructed frcrni a variety of county maps of 
more or less precision. These were photographed, and 35mm slides were 
prepared. Efforts to superimpose a minor civil di\rision map on the 
school district map were fruitless. The University of Wisconsin Geography 
Department advised that the time and money needed to achieve reasonable 
acc Piracy were prohibitive. 

Professor Randall Sale experimented with estimation techniques to 
determine if estimation time could be reduced. He found that after 
some practice, he could obtain proportional estimates not significantly 
different from those obtained by using a planlmeter . 

Another device sometimes useful was counting sections within town- 
ships. Net all the county maps displayed these sections, but for those 
which did, percentages were easily and accurately calculated. 



The remaining i>ages of this appendix comprise an instruction 
booklet which aided in carrying out the mapping task. 
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INSTRUCTION BOOKLET 



A. Problem ; 

Assign each township or part of township, town or part of town, 

census tract or part of census tract to a specific school district. 

B. Materials Needed ; 

1. State map with all school districts identified by code numbers. 

2. All county maps with school district lines clearly indicated. 

3. Maps for all SMSA's in the state. 

U. A print -out of the list of counties, townships, towns, etc. 
of the states with four dashes and slashes dividing the space 
to the right of the name. 

For example: Blair Twp. - / / - / - 

5. A table of proportional parts enabZ.ing workers to convert numbers 
quickly to percent of the township. (Most townships had 36 
sections— see Appendix C, Figure 1.) 

6. A list of all school districts both alphabetically and numerically 
arranged for cross-checking purposes. This listing should be by 
county, the counties being alphabetically arranged. (See Appen- 
dix C, Figure 2.) 

C. Personnel Needed : 

!• Staff instructor to train and supervise local people. 

2. Three pairs of workers; the first of each pair makes estimates 
of area and the second writes percents on the print -out. 

D. Process ; 



1* Divide the state into three parts geographically so that each 
pair of workers is responsible for approximately the same 
number of counties. 

2. Display the large state map indicating all minor civil divisions 
and explain hew the state is divided into townships and the 
townships into sections. 

3. Practice estimating areas on townships (see Appendix C, Figure 3), 
and then cross check the results using the table of proportional 
parts. For example, 12 sections of the 36 section township 
would be equivalent to 33^ of the township. 

U. After a couple of practice estimates verified by the staff 
instructor, supply each pair of workers with a state map on 
which the school districts are numerically listed and a county 
map with the school district lines clearly drawn and the various 
school district names indicated. 

5. Ifeve one worker do the estimating of area and the other enter 
the estimates (percents) on the census print -out. Percents are 
listed after the number of the school district. For example: 

?U6l-7(>t/^29-2O;^/:H8l7-10^/ - 

- 189 - 




No. of No. of 

Sections ^ Section s %. 



1 


3 


2 


6 


3 


8 


4 


11 


5 


14 


6 


17 


7 


19 


8 


22 


9 


25 


10 


28 


11 


31 


12 


33 


13 


36 


14 


39 


15 


42 


16 


44 


17 


47 


18 


50 



Appendix C, 



19 


53 


20 


56 


21 


58 


22 


6l 


23 


64 


24 


67 


25 


69 


26 


72 


27 


75 


28 


78 


29 


81 


30 


83 


31 


86 


32 


89 


33 


92 


34 


94 


35 


97 


36 


100 



Figure 1 



Table ^f Porportional Parts 



2 k 1845 Dow City-Arion Comm Sch Dist 
2 k 3996 Manilla Conan School Dist 
2 k 5832 Schleswig Comm Sch Dist 

Delaware 

28 1^9 Edgewood Colesburg Comm Sch Dist 
28 4 bi »3 Maquoketa Valley Comm Sch Dist 
28 6950 West Delaware Comm Sch Dist 
28 9025 Bremen Twp. Sch Dist 

Dickinson 

30 342 Arnolds Park Cons Sch Dist 
30 2846 Harris Lake Park Comm Sch Dist 
30 4284 Milford Ccanm School Dist 
30 6120 Spirit Lake Comm Sch Dist 
30 6345 Terril Comm School Dist 

Guthrie 

39 522 Bayard Community Sch Dist 
Kossuth 

55 126 Algona Comm School Dist 
55 900 Burt Community Sch Dist 
55 3456 Dakota Cons School Dist 
55 3573 Ledyard Comm School Dist 
55 389^’ Lu Verne Comm Sch Dist 
55 5868 Sentral Comm School Dist 
55 6309 Swea City Comm Sch Dist 
55 6417 Titonka Cons School Dist 
55 9022 Bancroft Cons School Dist 
55 9026 Greenwood Twp Sch Dist 
55 904l Ramsey Rur Ind Sch Dist 
55 9047 Grant Cons School Dist 



l 4 9057 Boise Rut Ind Sch Dist 
14 9058 Lincoln Rur Ind Sch Dist 
l 4 9059 Granville Rur Ind Sch Dist 
14 9060 Sheridan Rur Ind Sch Dist 
l 4 9062 Buck Run Rur Ind Scii Dist 
l 4 9063 Prairieville Rur Ind S D 
l 4 9064 Storm Creek Rur Ind S D 

Dubuque 

31 1^3 Dubuque Comm School Dist 

31 6961 Western Dubuque Comm Sch Diflb 

Emmet 

32 333 Armstrong Comm Sch Dist 
32 2l44 Estherville Comm Sch Dist 

32 3700 Lincoln Central Comm Sch Dist 
32 5544 Ringsted Comm School Dist 

Jackson 

49 243 Andrew Comm School Dist 
49 585 Bellevue Comm Sch Dist 

49 404l Jfequoketa Comm Sch Dist 
49 4275 Miles Comm School Dist 
^9 5337 Preston Comm School Dist 
49 5733 Sabula Comm School Dist 

Palo Alto 

74 450 Ayrshire Cons Sch Dist 

74 2088 Emmetsburg Comm Sch Dist 
(Cylinder Joined) 

74 2556 Graettinger Comm Sch Dist 
74 3969 Mallard Comm School Dist 
74 5724 Ruthven Cons School Dist 
74 6921 West Bend Comm Sch Dist 



Appendix C, Figure 2 
List of School Districts 
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APPENDIX C, FIGURE 3 
Practice for areal esti . cting, 
superimposing map elements upon a hypothetical county, 



- 192 - 




miiii 



Some parts of Iowa townships wore not. Joined to a secondary 
school district at tne tisie of the study so that the total 
percent was not always 100^. 

6. Have e€u:h pair proceed county l?y county until they complete 
their geographical area- 

7* SMSA's were divided by percent of census tract. 



Transformation of the Census Data 

Except for Case IV (see Table U-C-2), the proportional correspondences 
of the disconsonant units had been determined throu^ mapping of area. 
Enough information had been gathered to verify that a particular district 
derives its population from 25^ of one township plus 100^ of another, 
etc. The single district in Case IV had two hi^ schools. Each atten- 
dance area received an undifferentiated allocation of census data in 
proportion to its area. 

All the information was present. In fact, more Information had been 
gathered than could be used. Basically, two lists were present: the 
first was a list of cuitual census data units, units for which data were 
stored on magnetic tape; the second was a list of all the geographic 
areas in Iowa, areas which had been assigned to school districts. 
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INDICATOR DEFINITIOllS , INTEROQRREIATIONS , AND 11EANS: 
THE 48 STATE SORVET 



A rwwihoi- of sources were used to obtsin data for the 48 state 
survey. Demographic and general educational data were obtained 
from Statistical Abstract of the U^S , and to a lesser extent from 
the P,S> Book of Facts Statistics and Information , For more detailed 
educational data two types of references were used. The first included 
two government publications on educational statistics* Public Secondary 
Schools Nmri>er 1 and the Digest of Educational Statistics , The second 
was Ranking of the States published by the National Educational 
Association, Other references were used to a more minor extent, 
are listed the sources used. 
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Construct Teacher Quality 



Indicators 

1. Average secondary school 
teacher's salary. 

2. Percent of male teachers. 



3. Supervisors per pupil. 



4. Guidance counselors 
per pupil. 



5* Pupil teacher ratio. 



Definition 



Source 



The average salary per 
public secondary school 
teacher by state* 1961. 

Percent of public secondary 
school teachers uho were 
men by state* 1959. 

The nuod>er of department 
heads and supervisors per 
public secondary school 
pupil multiplied by 1000 
per state* 1959. 

The nuad>er of public secon* 
dary school guidance coun* 
selors per pupil imiltiplied 
1^ 1000 per state* 1959. 

The number of public secon* 
dary school pupils per public 
secondary school teacher per 
state* 1959. 



SA* US2* p. 129 



PSS* 1961* p. 44-5 
PSS* 1961* p. 44-5 



PSS* 1961* p. 44-5 



PSS* 1961* p. 16 



6. Librarians per secondary 
school pupil. 



The nund>er of public secon- PSS* 1961* p. 44-5 
dary school librarians per 
secondary school pupil 
multiplied by 1000 per state* 







1959. 






Ptiixipal 




Standard 


lotfa 


Florida 




Component 


Means 


Deviations 


Value 


Value 




Loadina 


5175.062500 


770.807677 


5243.00 


5025.00 




464 • 


52.543750 


7.857160 


58.30 


47.80 




462 


.707292 


.505296 


0.3^^ 


0.79 




4-76 


1.570208 


.553521 


1.49 


0.98 




464 


22.266667 


2.362761 


18.80 


24.40 




-42 


1.545625 


.409191 


1.31 


1.40 




-64 






Intercorrelations 








1 


2 3 


4 5 


6 






1 100 


61 55 


54 -16 


-49 






2 61 


100 11 


44 -44 


-10 






3 55 


11 100 


62 “13 


-62 






4 54 


44 62 


100 -46 


-^1 






5 -IS 


-44 -13 


-46 100 


-11 






6 -49 


-10 -62 


-41 -11 


100 
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Construct School Size 



Indicators Definition Source 

!• Percent enrollment in small The percent of public sec- PSS, 1961, p« 12*13 
secondary schools* ondary school pupils 

enrolled In schools of 
1-499 pupils by state, 

1959. 



2. Percent enrollment in med- 
ium sized secondary schools. 



3. Change In number of secon- 
dary schools. 



4. Additional classrooms 
needed. 



5. Excess public secondary 
school pupils. 



The percent of public sec- PSS, 1961, p. 12-13 
ondary school pupils en- 
rolled In schools of 300- 
1499 pupils by state, 1959. 

The ch^ige In the number SA, 1953, p. 119, 

of publ : secondary schools 

1950 to 1960 expressed as 

the ratio of the 1960 figyae. 

to the 1950 figure per 

state. 

Number of additional DES, 1964, p. 40, 

classrooms needed to re- 
duce the public secondary 
school class size to 30 
secondary school pupils. 

This figure Is divided by 
total public secondary 
school pupils and multi- 
plied by loco per state, 

1262. 

The number of public aec- SA, 1962, p. 124 
ondary school pupils In 
excess of normal plant 
capacity divided by the 
total public secondary 
school pupils and multi- 
plied by 100 per state, 1961. 



Standard Iowa Florida 



Means Deviation s 

45.664583 18.190101 

38.872917 11.222387 

.069167 .213472 

.306875 2.086977 

.052083 3.863126 



1 

2 

3 

4 

5 





Value 




Value 






63.00 




26.10 






32.10 




45.10 






Oi.78 




1.11 






1.99 




7.17 






3.50 




9.10 






Intercorrelations 




1 


2 


3 


4 


5 


100 


-76 


-44 


-06 


-01 


-76 


100 


37 


-18 


01 




37 


100 


-03 


04 


-06 


-18 


-03 


100 


13 


-Cl 


01 


04 


13 


100 



Principal 

Component 

Loading 



-90 

•i«9 

*^68 

-11 

-W3 
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Construct State Control 



Indicators 



Definition 



Source 



1. Number of state-required 
courses . 



Ntaber of new courses (as CR, I960, p« 13 
of Dec. 1955) required with 
content and methods subject 
to state or local authority. 



2. Number of state-recom* 
□ended courses. 



Number of new courses (as CR, I960, p. 13 
of Dec.y 1955) recomnended 
by state or local author* 
ities. 



3. Number of state-guided 
courses. 



Number of new courses (as CR, 1960, p. 13 
of Dec., 1955) either 
required or recommended by 
state authorities. 



Means 


Standard 

Deviations 


Iowa 

Value 


Florida 

Value 


Principal 

Component 

Loading 


2.304167 


2.497829 


O.C 


4.0 


+62 


13.958333 


5.058597 


9.0 


19.0 


+79 


6.479167 


4.949704 


0.0 


17.0 





Intercorrelations 



1 


100 


23 


29 


2 


23 


100 


48 


3 


29 


48 


100 
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Indicators 

1. Percent state funds. 



2« Percent federal funds. 



3. Per*caplta state edu* 
catlonal expenditure. 

4. Percent spent on 
education. 



5. Percent spent on 
public welfare. 



3. State educational 

expenditure per pupil. 



Construct Funding 
Definition 

Percent of revenue receipts for 
public elementary and secondary 
schools from state sources per 
state, 1961*2. 

Percent of revenue receipts for 
public elementary and secondary 
schools from federal sources per 
state, 1961*2. 

The total state expenditure on 
lower education divided by the 
total state population, 1960. 

Direct general expenditure of 
state and local government by 
functlon*percent spent on edu* 
cation per state, 1960. 

Direct general expenditure of 
state and local government by 
functlon*percent spent on 
public welfare per state, 1960. 

The total state expenditure on 
lower education divided by the 
total public school pupils (ele* 
mentary and secondary) per state, 
1960. 



Source 

ms, 1964, p. 58*9 



DES, 1964, p. 58*9 



SA, 1962, p. 12, 
p. 113 

SA, 1962, p. 423 



SA, 1962, p. 423 



SA, 1962, p. 113 



1-leans 


Standard 

Deviations 


Iowa 

Value 


Florida 

Value 




Principal 

Component 

Loadlne 


39.537500 


18.191042 


11.30 


52.20 




*42 


5.608333 


2.207264 


3.50 


5.70 




*70 


69.958750 


13.935488 


77.17 


58.10 




83 


36.845833 


4.136220 


37.00 


32.60 




08 


8.522917 


3.104247 


8.20 


6.30 




*45 


365.083333 


83.343624 


400.00 


310.00 




93 








Inter correlat Ions 










1 


2 3 


4 5 


6 






1 


100 


26 *14 


26 07 


*32 






2 


26 


100 *32 


09 18 


*59 






3 


*14 


*32 100 


31 *28 


81 






4 


25 


09 31 


100 *20 


*06 






5 


07 


18 *28 


*20 100 


*25 






6 


*32 


*59 81 


*06 *25 


100 
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Construct Quantitative Population Change 



Indicators 



Definition 



1« Population increase. 



Percent change in U«S. 
.population by state, 
1950-60. 



2. Population mobility. 



Percent of 19 SO population 
who resided in a different 
county (or abroad) from 
their 1955 residence. 



3. Birthrate. 



Tli3 nuuber of 
live births per 1000 popu- 
lation for each state, 
1960. 



4. 



Population under 15 
years. 



The number in thousands of 
persons under 15 years per 
state, 1960. 



5. 



Change in public 
secondary school 
pupils. 



The 1963 figure for pulrllc 
secondary school pupils 
divided by the 1950 figure 
per state. 
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me. 



Source 

USB, 1966, p. 13 



USB, 1966, p. 33 



SA, 1961, p. 50 






USB, 1966, p. 23 



SA, 1953, p. 120, 
1962, p. 125 



6. Percent of population 


The percent of 


the popu- 


USB, 1 


under 15 


years old. 


lation under 15 years by 


p. 23 






state, 1960. 




Principal 




Standard 


Iowa 


Florida 


Component 


lleans 


Deviations 


Value 


Value 


Loadine 


18.910417 


18.731983 


5.20 


78.70 


+84 


17,681250 


5.638139 


14.50 


31.00 


+62 


24.408333 


2.180^^85 


23.30 


23.30 


+70 


1151.916667 


1106.634516 


858,00 


1467.00 


-22 


1.510417 


.288241 


1.24 


2.31 


+65 


31.906250 


2.239829 


31.10 


29.30 


-h48 






Inter corre la t ions 






1 100 


73 32 09 


90 01 






2 73 


IOC 35 -20 


66 15 






3 32 


35 100 -28 


35 85 






4 09 


-20 -28 100 


01 -29 






5 90 


66 35 01 


100 12 






6 01 


15 85 -29 


12 IOC 














Construct Qualitative Population Change 



Indicators 



!• Net migration white 
population. 



2. Net migration non* 
white population. 



3. Percent Negro. 



4. Change in urbanization, 



Definition 

The migration from abroad, 
interdivisional, interregional, 
and interstate of U.S. white 
population, 1950*60. 

The migration from abroad, 
interd ivisional , interregional , 
and interstate of U.S. non* 
white population, 1950*60. 

The percent of U.S. population 
who are Negro, by state, I960* 

The change in urbanization 
between 1950 and 1960 per 
state. 



Source 



USB, p. 34 



USB, 1966, p. 34 



USB, 1966, p. 27 
USB, 1966, p. 16 



5. Percent urbanization. 



The percent of the 
state's population llv* 
ing in urban areas (2500 or 
more people), 1960. 



USB, 1966, p. 16 



Percent engaged in 
manufacturing. 



Percent of technical 
and professional 
workers. 



Percent of employees in non* 
agricultural establishments 
working in manufacturing, 1960, 

The percent of the state's 
population who were technical 
or professional workers, 1960. 



SA, 1962, p. 224 



RS, 1966, p. 59 



Principal 





Standard 


Iowa 




Florida 


Component 


Means 


Deviations 


Value 




Value 


Load ins 


2.139583 


17.315982 


-9.10 




70.00 




+39 


12.566667 


32.375823 


12.30 




15.60 




-i«l 


8.910417 


10.479330 


0.90 




17.80 




-62 


7.008333 


5.636408 


5.40 




8.50 




*51 


60.993750 


15.417553 


53.00 




73.90 




•^80 


28.531250 


14.292352 


26.00 




15.60 




-08 


11.304167 


3.810400 


9.70 




10.30 




-09 






Intercorr elat ions 








1 100 


32 01 


25 


44 


*07 


25 




2 32 100 -45 


-33 


46 


25 


14 




3 01 -45 IOC 


19 


-28 


26 


*11 




4 29 *; 


33 19 


100 


-31 


01 


60 




5 i'A 46 *23 


-31 


100 


*08 


-04 




6 *07 : 


25 26 


01 


-08 


100 


49 




7 25 


L4 *11 


60 


-04 


49 


100 
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Construct Social Responsibility 



Indicators 



Definition 



1. Percent voting. 

2. Percent covered by 
hospital insurance. 

3. Expenditure for parks. 

4. Amount raised by 
coDomunity chest. 

5. Membership in AFL-CIO, 



Percent of voting age popu- 
lation who voted for president 
in i960 by state. 


USB, 1966, p. 381 


Percent of i960 population 
covered by hospital insurance 
by state. 


SA, 1962, p. 479 


The current operating expendi- 
ture for municipal and county 
parks i960 calculated on a per 
capita basis by state, i960. 


SA, 1962 j p. 12, 

p. 205 


The amount raised by community 
chest and united fund campaigns 
i960 calculated on a per capita 
basis by state, I960. 


SA, 1962, p. 12, 
p. 306 


The membership in AFL-CIO 
per state calculated on a 
per capita basis and 
multiplied by 100, i960. 


SA, 1962, p. 12, 

1963, p. 249 



Means 


Standard 

Deviations 


Iowa 

Value 


Florida 

Value 


Principal 

Component 

Loading 


64.093750 


15.322919 


76.50 


50.00 


464 


68.502083 


11.837396 


59.80 


66.80 


+89 


1.5^^2083 


.975923 


0.99 


2.74 


+73 


2.0' 3? 25 


.987612 


2.12 


1.80 


+78 


5.969583 


3.205804 


4.89 


3.00 


+87 



Intercorrelat ions 



1 


100 


58 


29 


27 


44 


2 


58 


100 


49 


69 


66 


3 


29 


49 


100 


42 


66 


4 


27 


69 


42 


100 


60 


5 


44 


66 


66 


60 


100 
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Construe* Edur:,tional Attainment 



Indicators 

1. Percent with 4 years of 
high school. 



2. Percent with 4 years of 
college . 



3. Median school years. 



4. Percent illiterate. 



5. Number of doctors. 



6. Number of psychologists. 



7. Percent of professional 
and technical workers. 



8. High school graduates. 



Definition Source 

The number of individuals DBS, 1964. p. 125 

25 years old and older who 

have ccmipleted 4 or more. 

years of hi^ school I960 

divided by the total state 

population 25 yesurs and 

over, multiplied by 100, 

i960. 



The number of individuals DBS, 1964 p. 125 

25 years old and older who 

have completed 4 or more 

years of college i960 divided 

by the total state population 

25 years and over, multiplied 

by 100, i960. 



The median number of school RS, 1966 p. 25 
years completed by 
individuals 25 years and 
over, i960. 

The percent of each state's RS, I966, p. 25 
population l4 years old and 
older who were illiterate, 

i960. 



The number of doctors per SA, 1962, p. 74 
state per 100,000 popula- 
tion, i960. 

The number of psychologists APA, I961, pp. 806- 
per state per 100,000 popu- 943 
lation, i960. 



The percent of the state's RS, 1966, p. 59 
population who were tech- 
nical or professional 
workers, i960. 

Public high school grad- SSS, 1959^ PP- 56-7 
uates 1959 as a percent of 
1955-6 ninth grade enroll- 
ment . 
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Means 


Standard 

Deviations 


Iowa 

Value 


Florida 

Value 


Principal 

Component 

Loading 


41.170833 


7.165047 


46.30 


30.60 


489 


7.443750 


1.518484 


6 . 4 o 


7.40 


+82 


10.552083 


1.093540 


11.30 


10.90 


+90 


2.252083 


1.386541 


0.70 


2.60 


69 


119.250000 


28.442559 


117.00 


131.00 


+71 


8.035417 


4.258300 


110 


7.60 


+77 


10,856250 


1.649609 


9.70 


10.30 


-^65 


67.012500 


6.846947 


76.50 


65.60 


+59 



Inter correlations 



1 


100 


70 


96 


-72 


4l 


44 


68 


59 


2 


70 


100 


71 


-23 


56 


66 


90 


19 


3 


96 


71 


100 


-71 


47 


49 


69 


55 


4 


-72 


-23 


-71 


100 


-33 


-34 


-35 


-74 


5 


4l 


56 


47 


-33 


100 


86 


60 


24 


6 


44 


66 


49 


-34 


86 


100 


72 


31 


7 


68 


90 


69 


-35 


60 


72 


100 


25 


8 


59 


19 


55 


-74 


24 


31 


25 


100 
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Construct Ability to Pay 



Indicators 



1. Per-capita income. 

2. Change in per-capita 
income. 



3» Poverty. 



4. Percent of non -public 
secondary schools. 



5. Change in non -public 
secondary school 



Definition 


Source 


The per-capita income i960 
determined for each state 


SA, 1962, p 


The percent change in per- 


SA, 1962, p 


capita income between 1950 
and i960 per state. 


% 


Percent of population with 
income under $2000 per state ^ 

1959. 


SA, 1962, p 


Percent of total school 


SA, 1963, P 


enrollment i960 who are 
enrolled in non-public 
schools per state. 


p. 131 


The percent of all secon- 


SA, 1953, P 


dary school pupils attending 


p. 133, 1963 


non-public schools i960 
divided by the comparable 
figure for 1950 per state. 


p. 131 



Means 


Standard 

Deviations 


Iowa 

Value 


Florida 

Value 


Principal 

Component 

Loading 


2064.645833 


423.093542 


2003.0 


1988.0 


+92 


77.854167 


29.238238 


46.0 


174.0 


+24 


14.641667 


7.359966 


15.2 


16.2 


-93 


10.172917 


6.402107 


11.0 


10.5 


+71 


1.127083 


.272748 


1.1 


1.9 


-08 



Intercorrelat ions 



1 


100 


26 


-86 


46 


04 


2 


26 


100 


-19 


-10 


42 


3 


-86 


-19 


100 


-53 


-04 


4 


46 


-10 


-53 


100 


-40 


5 


04 


42 


-04 


-40 


100 
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RETRIEVAL TERMS 



iOEN TIFiERS 



I ELEMENTS OF EDUCATIONAL PRODUCTIVITY WERE IDENTIFIED WHICH ARE 
INFLUENCED BY SIZE OF SECONDARY SCHOOLS. PRODUCTIVITY INCLUDES 
OUTCOMES SUCH AS ACADEMIC PROGRESS AND SOCIAL BEHAVIOR AND EXCLUDES 
ECONOMIC OUTCOMES SUCH AS GAINS IN INCOME. THE RESEARCH IN THIS 
FIRST PHASE WAS DONE WITH THE HIGH SCHOOLS OF IOWA. DATA STORED WITH 
THE IOWA EDUCATIONAL INFORMATION CENTER AND THE U.S. BUREAU OF THE 
CENSUS WERE USED. CLASSES OF INDICATORS WERE DEVELOPED AND ANALYZED 
STATISTICALLY. STEPS WERE THEN TAKEN TO (1) DEFINE SUB-CLASSES OF 
HOMOGENEOUS INDICATORS, (2) PERFORM PRINCIPAL COMPONENTS ANALYSIS ON 
EACH SUB-CLASS TO CHECK HOMOGENEITY, REMOVE AMBIGUITY, AND COMPUTE 
COMPONEN’T SCORES, (3) PERFORM IMAGE ANALYSIS ON THE COMPONENTS 
WITHIN EACH CLASS OF INDICATORS AND COMPUTE IMAGE FACTOR SCORES, (4) 
PERFORM REGRESSION ANALYSIS OF IMAGE FACTORS FOR EACH CLASS OF 
INDICATORS, USING PRODUCTIVITY AS THE DEPENDENT VARIABLE, AND (5) 
INTERPRET SIGNIFICANT INTERACTION REGRESSION COEFFICIENTS. TEN IMAGE 
FACTORS WERE EXTRACTED, BUT THE SPARSITY OF DATA ON SOCIAL 
CHARACTERISTICS OF SCHOOL DISTRICTS REDUCED THE VALIDITY OF THE 
ANALYSIS. A PRINCIPAL CONCLUSION OF THIS PHASE IS THAT DATA DESIGNED 
AND COLLECTED FOR ADMINISTRATIVE PURPOSES CAN BE AS COSTLY AND TIME 
CONSUMING TO CONVERT TO RESEARCH USES AS THE SAMPLING OF POPULATIONS 
AND GATHERING OF ORIGINAL DATA. 
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